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ABSTRACT 

The relevance of a specific linguistic concept--deep 
structure — to reading comprehension was investigated* The sample 
consisted of 87 fifth-grade students who were of above-average 
intelligence* To study the relationship of childrens skill at 
recovering the deep structure of sentences to reading comprehension, 
the Deep Structure Recovery Test (DSRT) was designed and 
administered* Reading comprehension, measured by a cloze test and the 
reading subtest of the Metropolitan Achievement Test, correlated 
significantly ■ ; .th the DSRT* The DSRT was found to be the most 
important fact ^ in reading comprehension as measured by the cloze 
test when compared to IQ, word knowledge, and word recognition 
skills- The relationship of children's skill at making a lexical 
analysis of the main verb of sentences and reading comprehension was 
measured by a sentence completion teat and the subjects' performance 
on sets of items which contained different types of verbs* It was 
concluded that while the ability to recover the deep. structure is an 
important aspect of reading comprehension, the results did not 
indicate a clear relationship between lexical analysis strategy and 
reading comprehension* Tables and a bibliography are included* 
(Author/DH) 
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ABSTRACT 

This study is concerned with the relevance of a specific 
linguistic concept — deep structure -- to reading comprehension. In 
linguistic theory it is the deep structure which describes the logical 
subject and object of a sentence. The relevance of this concept (deep 
structure) to reading comprehension is first studied by looking at the 
relationship between children's skill at recovering the deep structure 
of sentences and their reading comprehension. Then! one strategy that 
children might use in vecovering deep structure and the relationship 
of this strategy to reading comprehension is studied. This strategy 
— the lexical analysis strategy — involves analysis of the main verb 
of a sentence to determine the deep structures with which it is compatible. 

The sample for the study consisted cf 87 fifth grade students 
from a suburban community. The students were of above average 
intelligence with a mean I.Q. of 117. 

The first hypothesis of the study is* Children's skill at 
recovering the deep structure of sentences is related to reading 
comprehension skill. This hypothesis is evaluated through a test f 
the Deep Structure Recovery Test (D.S.R.T.), which is designed to 
measure Ss' skill at recovering the deep structure of sentences. Each 
item of the D.S.R.T. contains three sentences. In taking the test an 
S is asked to choose the one sentence out of the thre* which is not a 
paraphrase of the other two sentences. In each item two of the sentences 
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have Che same deep structure vhile a third sentence, although super* 



flclally similar Co at least one of Che other two sentences, has a 



different deep structure and thus a different meaning, for example, 



* (a) What the boy vould like is for the girl to leave. 
<b) For the boy to leave is what the girl vould like, 
(c) What the girl vould like is for the boy to leave. 



In this study reading comprehension is measured by a Cloze test, 

It is trgued that a CJoze test is superior to traditional standardized 
comprehension tests. The traditional Metropolitan Achievement i Reading 
Subtest (the M.A.T. Reading test) is also used for comparison purposes. 

I.Q.,, word knowledge and word recognition ability are also measured, 

A correlation and regression analysis produced these results: , 

1. The D.S,R.T, has a substantial and significant correlation, 
i.e* *732, with the Cloze test and a significant correlation 
with the M'A'T. Reading test, 

; ; 2» The D,S,R,T« is the most important factor in reading comprehension 
as measured by the Cloze test, when compared to I.Q,, word 
knowledge and word recognition skill. The D,S.R.T. alone 
accounted for more than 20X of the unique variance in reading 
comprehension* The other variables made little if any unique 
contribution . 



The asterisk indicates the sentence with the different deep structure* 
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3, Word knowledge is the most important factor in reading 
comprehension as measured by the H.A.T . Reading test. 

However neither word knowledge nor any of the other variables 
contributed much unique variance. 

The second hypothesis of thiL study iss Children^ skill at 
making a lexical analysis of the main verb of sentences is positively 
related to reading comprehension skill. Ss 1 skill is measured in two 
ways, first by a sentence completion test and second by Ss' performance 
on sets of items (included in the D.S.R.T.) which contained different 
types of verbs. Reading comprehension again is measured by a Cloze 
test and the M.A.T* Reading test. While the results of the analysis of 
the Sentence Completion test support the hypothesis, the results of 
the analysis of the items on the D.S.R*T. do not. 

Inshort, it appears that Ss 1 ability to recover the deep 
structure of sentences is an important aspect of reading comprehension. 

It is not clear, however* that the lexical analysis strategy is related 
to reading comprehension. 
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And so to completely analyze what we do when we read would 
almost be the acme of a psychologist's achievements » for it 
would be to describe very many of the most intricate workings 

♦ 

of the human mind, as well as to unravel the tangled story 
of the most remarkable specific performance that civilization 
has learned in all its history. 

Edmv * Burke Huey 

THE iSYCHOLOGY AND PEDAGOGY OF READING 
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As Thorndike pointed out in 1917 reading comprehension is 
n ...a very complex procedure involving a weighing of each of many 
elements In a sentence, their organization in proper relation to 
one another, the selection of certain elements of their connotations 
and the rejection of others and the cooperation of many forces to 
produce the final response [1917, p.323), 11 Unfortunately, in spite 
of the vast literature produced on the topic of reading comprehension 
over the past 50 years, thi' , *ote almost completely exhausts the 
accumulated knowledge of this fundamental intellectual process. 

Despite this voluminous research the workings of the mind during 
reading comprehension remain a great and profound mystery. Conse- 
quently* the Instructional procedures and materials used when teaching 
children to comprehend what they read are based upon the intuitions 
and accumulated {experience of reading specialists, not on research 
evidence. 

The major reason, beyond methodological problems, for the inade- 
quacy of paat research has been a lack of basic knowledge of the 
psychological processes Involved in reading comprehension* If 
instruction in reading comprehension is to be based on research 
evidence as veil as the intuitions and experience of reading specialists, 
it is l*portr»nt that fruitful research shed some light upon the psycho- 
logical processes Involved in reading. Knowledge of these processes 



13 



2 . 



can then provide the basis for instruction. 

It is the purpose of this thesis to suggest a new approach to 
the study of the reading comprehension process and to report an em- 
pirical Investigation of the author's which has been motivated by 
this new approach. Fundamental to this new approach is the need to 
base comprehension research on theory. The linguistic theory developed 
by Chomsky (1957, 1965) is the one on which this thesis is based. 

The study reported In this thesis is an investigation of the relation- 
ship of one aspect this linguistic theory to the reading comprehension 
process. 

The paper is organized into six chapters. Chapter One offers 
& critique of past research and a discussion of the direction that 
research. In the author's opinion, must take if it is to illuminate 
the processes basic to reading comp rehuns ion. Also In this chapter 
the rationale for basing this study on linguistic theory is developed* 

The hypotheses and assumptions of the study are presented in Chapter 
Two* Chapter Three Is a description of the measurement of the variables 
used in the study* The sample studied and the testing procedures are 
described in Chapter Fou* . The results and the analysis of the data 
are contained in Chapter Five. And finally in Chapter Six the implications 
of the study are discussed. 
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CHAPTER I 

Research in Redding Comprehension: The Need for New Perspectives 

A Critique of Seven Major Approaches to Comprehension 

In this chapter the inadequacies of the past research wil. 1 , be 
discussed , and the direction that research should take in order t> 
lead to an understanding of the complex^ process of readi^g^ .corapre- , 
hension will be described. 

The Skills Approach 

Mwiy reading specialists believe that reading comprehension can 
best be described and understood through the enumeration of lists of 
comp rehens ion skills. Therefore, most of the research has been in- 
fluenced directly or indirectly by this belief in a skill© explanation. 
Much of the research has endeavored to measure and verify existing 
skills and discover new ones. A discussion of these skills and the 
problems they raise will help demonstrate why the research has been 
so unproductive. 

A great many lists of skills have been proposed, with some 
lists containing several hundred or more skills. The following is 
a sampling of skills that have been proposed: 

- reading In thought units 

• grasping and assimilating relevant details 

- fusing new and old Ideas 

- thinking about words 
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- seeing relationships between words and thoughts 

- remembering what is read 

- finding the topic sentence 

- making generalisations 

- drawing inferences 

- underlining the key words, in a paragraph 

- predicting outcomes 

--distinguishing fact from opinion 

- recognizing literary devices 

- evaluating the author's competence 

- combining recall with own associations 

- reading to follow directions 

- offering new titles for paragraphs 

- finding the main idea of a paragraph 

Recently some of the lists of skills have been organized into 
taxonomies (Spache, 1962; Cleland, 1965; Wolf et al, 1968; Barrett, 1968). 
Taxonomies are pi med to be superior to simple llst6 because they 
provide systematic organization which Is intended to increase cur under- 
standing of the phenomenon under investigation. Taxonomies are systematic 
in the sense that they organize skills Into major categories and sub- 
categories. For example In the Barrett (1968) taxonomy one major 
category Is Inferential comprehension and the subcategories are inferring 
supporting details , Inferring main ideas, inferring sequences, Inferring 
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comparisons, inferring cause and effect relationships, inferring 
character traits, predicting outcomes and interpreting figurative 
language. 

The advantage of systematic organization, however, is more 
imagined than real. As Clymer (1966) has pointed out "as is true with 
all taxonomies, the orderly presentation of the categories may suggest 
a greater precision than the classification system really possesses 
(p.l9], M Clymer's observation applies to taxonomies of reading skills. 
The analysis that follows applies equally to taxonomies and simple lists 
of skills. 

The skills approach to an explanation of reading comprehension 
suffers from a basic confusicn over what domains of behavior and cog- 
nitive activity can rightfully be characterized as reading comprehension 
This confusion has led to the inclusion of reading comprehension skills 
that tend to be global *nd vague and that fail to distinguish between 
those processes that are specific to reading and those that are very 
general cognitive processes. 

The lists and taxonomies of comprehension skills fail to clearly 
distinguish between general mental abilities and those skills specifi 
to reading. For example, the reading skill of "making generalizations" 
Is a skill common to most cognitive processes and hardly specific to 
reading* In designating such of the thinking skills as reading compre- 
hension skills the leading specialists are now obliged to explain the 
thinking process in order to include the thinking skills in reading 
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comprehension instruction. To say that this is a formidable task 
is an understatement since very little is presently known about 
thinking processes. 

Another problem with lists and taxonomies of comprehension 
skills is that they fail to make the distinction between how some- 
thing is comprehended and what is comprehended. As a result they 
fail to define adequately either how something is comprehended or 
what is comprehended. For example, in the skill of "fusing new and 
old ideas" what is learned is "the ideas" and the process is the fusion 
of these ideas. In order for the description of this skill to be 
meaningful, it is necessary to provide an adequate theory of what 
ideas are as well as a specific description of the fusion process, 
Neither the adequate theory of ideas or a specific description of the 
fusion process are presently available. This case is typical of the 
status of many comprehension skills. It appears that a description 
of what is comprehended is necessary before attempting to describe 
the process of how something is comprehended. So that in the skill 
of "fusing new and old ideas" a theory of ideas must be developed 
before research into the "fusion" process can begin. It lcoks like 
the place to begin in comprehension research is with theories of what 
is comprehended. The recent work in linguistic theory (Chomsky, 1957, 
1965) offers Just such an opportunity. This point which has motivated 
the present study will be expanded upor later. 

Another problem with mental skills as explanations of reading 
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comprehension is that they have been used in the literature to describe 
three distinguishably different types of activities as follows: 

1. High level cognitive processes, i.e., processes that involve 

very sophisticated thinking abilities which go beyond a literal under- 
standing of the reading matter* These high level cognitive skills 
include: "distinguishing fact from opinion", "recognizing literary 

devices", "evaluating the author*s competence", etc. 

2. The psychological processes involved in reading comprehension, 
"fusing new and old ideas", "reading in thought units", "combining recall 
with own associations." 

3. The procedures for teaching comprehension e.g., "finding the 
topic sentence", "offering new title for paragraphs", etc. 

Unfortunately, reading specialists have often failed to distinguish 
between these three categories in their discussion of reading compre- 
hension skills. Consequently, there has been a great deal of confusion^ 
as reading specialists attempt to evaluate the importance of the various 
skills and develop ways of teaching these skills to children. This is 
the case because in attempting to evaluate the Importance of the pro- 
posed skills and in developing techniques for teaching these skills, 
there are different considerations to be taken into account depending 
upon which of the above category of skills one is talking about. This 




Some skills fall iito more than one category depending upon how they are 
Interpreted. For example, the skill "thinking about words" could be 
either a high level cognitive skill or a process skill. The fact that 
some skills have more than one meaning, of course only adds to the 
confuslocu 
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point will become clearer as each category of skill is discussed in turn. 

The first category is high level cognitive processes. In eval- 
uating the Importance of these skills the most important consideration 
is whether enough is understood about the mature reader’s skill to 
teach them to children. This will involve an understanding as dis- 
cussed above of both the objects of the skill as well as the process 
employed in using the skill. Very few high level cognitive skills 
have been analyzed in light of his consideration. However, Schell 
(1967) has analyzed the high level cognitive skill of "distinguishing 
fact from opinion" along these lines. He shows first that the task 
of actually distinguishing fact from opinion is a very difficult task 
much of the time, because the distinction between fact and opinion 
is so ill-defined that in all but the most obvious cases this distinction 
is hard to draw. 

In addition, in vhe cases where this distinction could be drawn 
the relevant information for drawing this distinction is often not 
contained in the reading matter. Thus, in the case of the comprehension 
skill of "distinguishing fact from opinion" it appears that we do not 
know very much about the objects of this skill, i.e., facts and opinions, 
nor about the processes employed in applying this skill. Consequently 
it is not surprising that Schell (1967) finds that "there are no handy 
rules of thumb ve can give students to help them in this task (p.9]." 

The rule of thumb now used for teaching children how to decide when 
something is an opinion is to look for qualifiers, e.g., seems, appears, 
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nay, probably, or Indicators, c.g., I think, we believe, our conclusion, 
etc. This rule of thumb is of questionable validity because authors 
commonly omit qualifiers and indicators in expressing opinions. Thus 
the lack of knowledge about the object and the processes of comprehension 
reduces the teaching of this skill to superficial techniques of question- 
able validity. Examination of other higher level cognitive skills 
reveals more of the same. 

The second category of skill presumes to describe the psychological 
processes employed in comprehension. Since, as will be argued in this 
chapter, description and explanation of the psychological processes 
involved in reading comprehension are a crucial prerequisite to improving 
instruction, the validity of these process skills is of utmost importance. 
The question to be answered in assessing the validity of process skills 
is, how good a description and explanation do these skills provide of 
the comprehension process? In examining the process skills one is forced 
to conclude that the answer to this question is that the process skills 
proposed In the literature do not describe and explain the comprehension 
process at all. Such skills as "reading in thought units/' "fusing new 
and old ideas/' "grasping and assimilating relevant details" hardly 
qualify as precise descriptions of the comprehension process. They 
are merely metaphors and as auch do not lend themselves to scientific 
theory construction or empirical research. These skills, or rather 
the present descriptions of the 1 !!, only raise a lot of questions without 
providing any answers. Wardhaugh (1969) in talking about the fusion 
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metaphor gives a succinct statement of some of the problems raised by 
this type of vague description. 

...the fusion metaphor itself is a poor and unrevealing one in that 
it short circuits the process of explaining exactly what happens 
when someone understands a sentence. The concern should be with exactly 
what factors are involved in fusion and why fusion occurs on some 
occasions but not .a others [p.86J. 

For the process skills to be meaningful as descriptions of the 
comprehension process it is necessary first to know more about the 
objects of these skills, i.e., ideas, thoughts, details, etc. In 
addition, it is necessary to provide an explicit description of the 
fusion, assimilation, and grasping process, etc., and the conditions 
under which they are utilized. Only then can skills such as rT fusing 
new and old ideas" have any significance a* descriptions of the 
comprehension process. 

In many lists and taxonomies ona finds comprehension skills 
such as "underlining the key words iu a paragraph," "finding the 
topic sentence," "offering new titles for paragraphs" etc. This group 
of "skills* 1 has caused some confusion because they are quite clearly 
teaching procedures to be used to Improve comprehension rather than 
descriptions of the comprehension process. Sometimes, however, these 
skills are used as testing procedure? to demonstrate that comprehension 
has taken place. In any case they sre really a separate category and 
should not be listed as comprehension skills at all. 

It appears that simply because they were listed along with 
comprehension skills, they are often treated as ends in themselves. 

O 
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Thus teachers dogmatically insist that students must learn to underline 
the key words in a paragraph etc., if they can’t learn to do this they 
are considered to have poor comprehension skills. But since the "skills" 
are means rather than ends their validity and importance depends upon 
how successful these means (i.e., teaching procedures) are for achieving 
the desired ends (comprehension) , This of course assumes that compre- 
hension is well understood. As argued above, net very much is known 
about comprehension skills so that there is no real way of evaluating 
these teaching procedures, and they must be considered only tentative 
at the present time. 

To summarize, in the discussion up to this point it has been 
argued the* the setting up of categories of skills approach has not 
shed much light upon reading comprehension because of a basic confusion 
over the precise behavior and cognitive domain of these skills. This 
coni vision has led to global and vague skills which have failed to dis- 
tinguish (a) between reading and thinking, (b) between the objects 
and the processes of compreherision, and (c) between the high level 
cognitive processes, psychological processes, and teaching procedures 
of comprehension. 

The Measurement Approach 

The most popular way of measuring comprehension has been through 
the development of standardized reading comprehension tests. Typically 
these tests have been designed to measure successful performance on 

O 
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tne comprehension skills ditcussed in the previous section. In the 
most common format the student is asked to read a passage and then 
answer a series of questions which require him to select from alternative 
choices the main idea of the passage, or find some details, or select 
the predicted outcome, or select a generalization, etc. 

There is a fundamental problem with traditional reading compre- 
hension tests. Simply stated, it is not at all clear what they are 
measuring. They have been criticized because they may be measuring 
in addition to or in place of comprehension: 

1. a student *s memory skill on test3 where the student is 

not allowed to look back at the passage (Davis, 1962). 

2. a student's ability to comprehend the questions and 
alternative choices in the test. 

3. a student's test taking skill. 

4. a student's familiarity with the content of the passage. 

Kerfoot (1965) has shown that many of the questions on 
comprehension tests can be answered without reading the 
passage. 

5. a student's motivation and attitude. 

6. a student 'd personality. 

7. a student's ability to recognize the words. 

The lack of a theory of the reading comprehension process makes 



it almost impossible to separate the above aspects of a student's 
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functioning from his reading comprehension skill. For this reason 
traditional reading comprehension tests lack construct validity 
(Chronbaeh & Meehl, 1955); that is, no clear relationship has been 
established between the test and. the underlying concept — reading 
comprehension. Until theories of reading comprehension are developed, 
tests with construct validity will be impossible. 

In recent years a newer and better method of measuring reading 
comprehension has been developed. It is the Cloze procedure (Taylor, 
1953). On a Cloze test every nth word of a passage is deleted and a 
student takes the test by filling in the blanks where the words were 
deleted. The Cloze test is a better measure of reading comprehension 
than traditional tests because it is measuring reading comprehension, 
i*e. it correlates with traditional comprehension tests and at the same 
time it appears to be measuring fewer of the extraneous aspects of 
student functioning. Specifically, it does not have questions and 
therefore is not measuring a student’s skill in understanding questions. 
It is not a memory test because the student cat! continually re-examine 
the passage. It also does not appear to be measuring a student's 
familiarity with the content of the passage, at least to the degree 
that traditional tests do. The mechanical procedure for developing 
the test (the deletion of every nth word) renders it more objective 
and less subject to the arbitrary judgments of the test constructor 
than traditional comprehension tests. In addition, a great deal of 
research has gone into the Cloze test and much is known about its 
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operating characteristics (see Tremont, 1967 ; Potter, 1968), Obviously 
che Cloze vest l^cks construct validity for the same reason that all 
tests of the reading comprehension process do, that is, the absence 
of a theory of the process* 

In summary, traditional and Cloze tests both suffer from a lack 
of construct validity. However the Cloze test is a better measure 
of comprehension because in measuring reading comprehension it appears 
to be measuring fewer extraneous aspects of cognitive functioning than 
traditional tests do. 

The Factor Analytic , Appr oach 

Factor analysis is another technique that has been employed in 
an atteapt to understand reading comprehension. This approach has 
been closely related to skills and their measurement by standardized 
tests because factor analysis has most often been applied to standardizes 
tests* 

There have been a lumber of factor analytic studies of reading 
comprehension (Davis, 1944, 1968; Thurstone, 1946; Vernon, 1957; Hall 
& Robinson, 1945; Anderson, 1949; Langsam, 1941; Gans, 1940; Holmes 6 
Singer, 1966), Since the results of a factor analysis are in part 
determined by the age of the subjects, the tests used, the mathematical 
solution employed and the labels applied to the factors, and since 
these varied from study to study, it is not surprising that ve find 
some very disparate factors such as seeing relationships, word meaning, 

number factors, chsrt reading skills etc., In these studies. 

O 
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In general, these studies found that reading comprehension was 
composed of the same skills that reading comprehension tests measure — 
not an earth-shaking discovery. In examining the factor analytic 
studies for the factors that showed up in a number of studies, Spache 
• (Mf& IXound tk> * /actors in reading comprehension; a word meaning 

lactor, an idea relaurunship factor, a reasoning factor. But these 
factors tell us very little because they are no more than metaphorical 
descriptions of the reading comprehension process — the same problem 
that existed before these studies* 

Davis (1944) points out a requirement that factor analytic 

studies of comprehension must meet in order to be meaningful: 

The most important s-*p in a study that employs factorial procedures 
for the investigation of reading comprehension is the selection of 
the tests the scores of which are to be factored. Unless these tests 
provide measures of the most important mental skills that have to be 
performed during the process of reading, the application of the most 
rigorous statistical procedure cannot yield meaningful or significant 
results. The importance of this point can hardly be overestimated [p«3]« 

Clearly, comprehension terts do not meet the requirement that they 
measure "the most important mental skills that have to be performed 
during the process of reading," because the exact nature of these 
'•"“mental skills" is ; r»t yet known* The tssk for reading researchers 
is to discover and describe these le mental skills" in a very precise 
vay* Only then can tests be developed to measure these skills and 
then factor analysis may become a useful tool in looking at the 
relationships between these '‘mental skills," 
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The Correlational Approach 



Another widely used approach to understanding reading comprehension 
has been through the search for correlates of the comprehension process*. 
A number of studies, employing standardized tests as measures of compre- 
hension and correlational or group comparison research designs, have 
been conducted to determine what variables are related to the compre- 
hension process. These studies have found such correlates of compre- 
hension a si 

1« Social class (Chandler 9 1966) 

2. Face (Cooper, 1964) 

3. Sex (Gates, 1961) 

4« Personality (Vehar, 1962; Neville, Pfost & Dobbs, 1967) 

5. Attitude (Jacobson 6 Johnson, 1967) 

6* Physical growth (Olson, 1940) 

7* Intellectual ability (Karootamian, 1966; Bleismer, 1954) 

8. Perceptual skill lOlson, 1966) 

9* Rate of reading (Tinker, 1939; Carlson, 1949) 

10. Oral language skill (Tatham, 1967; Loban, 1966) 

11« Listening (Duker, 1965) 

These studies and others like them raise the same problems. Most, 
if not oil, of these studies employed standardized reading tests as 
measures of their dependent variable, i.e., reading comprehension, 
thus they ljso facto define reading comprehension in terns of the skills 
these teste purport to measure. However, as has been pointed out, the 
skills an d their measurement are of questionable validity, thus the 
relationships found by these studies between the above variables and 
reading comprehension are difficult to interpret, because the dependent 
variable of the studies is of questionable validity. 
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It Is difficult If not Impossible to determine what the factors 
enumerated above correlate with, since It Is not clear what compre- 
hension tests are Pleasuring. Thus I.Q., race, sex, etc. may be 
correlated not with the reading comprehension process but v/ilh the 
ability to answer questions, test taking skill, motivation and the 
like. 

But more importantly, even if the relationships found in these 
studies prove to be correct, how much does this add to our knowledge 
of reading comprehension? It clearly is of some value to know that 
intellectual ability, language ability, perceptual ability, etc., 
are related to reading comprehension. But the question of how and in 
what ways these variables are related to comprehension remains un- 
answered. What do these relationships indicate about the comprehension 
process? When more is found out about the comprehension process, 
then the findings of these studies will be of more value. 

In a sense these studies provide some of the facts that a theory 

of comprehension must explain. However enough of these facts have 

been gathered for the present time. As Wardhaugh (1969) puts it: 

...these factors multiply faster than systias for handling them, 
so that ve are very far at the moment from an acceptable theory 
of reading. This situation will continue to exist as long as 
researchers allow themselves to be sidetracked from the content 
of readlug into its correlates [p.4]. 

The Readability Approach 

Readability research is another area that is closely related to 
searching for correlates of the comprehension process. Looking at it 
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from the point of view of trying to understand comprehension, readability 
research is the search for those characteristics of written matter that 
are correlates of reading comprehension. Chall (1958) presents an 
exhaustive review of the work in readability. More recent developments 
can be found in Klare (1963) and Bormuth (1968). This research has 
been productive because it has had a direct influence cn Instruction. 
Publishers and teachers have used these formulas extensively in 
developing and editing reading materials for children. Thus readability 
research has had practical consequences and this in and of itself 
justifies the research. 

But from the point of view of understanding reading comprehension 
the question to ask is; how much light has readability research shed 

* 

upon reading comprehension? The answer is some but not much. 

Since readability research has tended to use comprehension tests 
that measure the same skills that have been discussed above, the character- 
istics of written matter that have been found in these studied to correlate 

with comprehension difficulty can be questioned because the measurement 

2 

of the dependent variables of these studies lack construct validity. 

Even if it is assumed that the dependent variables of these studies 
are accurately measured by comprehension tests, the question to be 
amivered is j vhat properties of printed matter account for comprehension 



The nore recent readability studies have used the Cloze test to measure 
comprehension. As has been pointed out the Cloze test is superior to the 
traditional measures in ways already discussed. It does, however, still 
ilack construct validity. 




30 



*^* 5 ;: **.r^'?^*€& ** &r!*>*r*r y^ow «■ ■* < 



19. 

difficulty and how informative are these properties? Chall (1958) 
has suggested that there are four major factors that account for 
comprehension difficulty: (a) vocabulary load, (b) sentence structure, 

(c) idea density and (d) human interest. Idea density and human interest 
are so global and fuzzy that they haven't been measured very success- 
fully. Vocabulary is measured most accurately by the number of words 
in a passage not on a given list of frequent words, and sentence 
structure is most accurately measured by some measure of sentence length* 
These factors appeal to be approximate measures of some underlying 
variables that are intrinsic to the comprehension process. In order 
for these variables to help in understanding reading comprehension 
the processes underlying them must be explained. An understanding 
of the reading comprehension process will then answer questions such 
as: What is it about sentence structure and vocabulary load that 

Influences comprehension difficulty? In what ways do sentence structure 
and vocabulary influence comprehension? What are idea density and 
human lute rest and how can they be described more precisely etc.? 

The Introspective Approach 

Another approach to understanding comprehension has been the 
study of readers' introspective reports immediately after reading 
(Cafone, 1966; Piekarz, 1968; Jenkinson, 1957). This type of study 
is admirable in that it proposes to take a close look at the comprehension 
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process and the mental activities involved in reading. 

However* introspective studies suffer an important methodological 
limitation in the case of reading comprehension. Since reading is a 
covert and complex mental process which takes place at great speed, 
it does not appear to be open to introspection. If one thinks about 
reading when one is reading, then the comprehension process is turned 
off, so that one can't describe the process during reading because 
of an Inability to pay attention to two things at once. It is not at 
all clear what the reports of subjects in these studies are descriptions 
of. There are so many factors such as memory, thinking, personality, 
motivation, etc*, other than the reading comprehension process that 
could account for these descriptions that the relationship between 
these descriptions and the comprehension process is questionable* 

In order for these studies to bear fruit it must first be shown that 
the descriptions that readers produce of their own reading bears a 
close relationship to their actual reading process* And given the 
speed, complexity and covert nature of the comprehension process the 
likelihood that these descriptions bear any close relationship to the 
reading comprehension process seems remote to this investigator. 

The Models Approach 

A recent approach to understanding the comprehension process 
has been the construction of theoretical models of the process (Spache, 1962 
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Smith, 1960; Kingston, 1961; Holmes & Singer, 1966. As Clymer (1968) 
puts It: 

Constructing a model forces the investigator to organize facts and 
to set them against a rational framework; at the same time, it 
provides a technique for testing these facts and for generating more 
hypotheses for testings 

It seems clear that models are one fruitful way of understanding 
the comprehension process. A model or reading comprehension can generate 
testable hypotheses which can be verified through empirical research 
and can ultimately lead to an adequate theory of the process. 

Ihe question that must be &sked about the proposed models of 
the comprehension process is; how much light do they shed upon the 
comprehension process? Unfortunately, the promise of the model's 
approach to reading comprehension has not been fulfilled. The models 
cited above of reading comprehension fall short because: 

1. The components and categories of these model 3 tend to be 
vague and global. The major terms are undefined. They look very 
much like the same old comprehension skills. Thus we find such model 
components as "recognition of a sentence as a complete thought," 
"comprehend main idea as extension of the topic sentence" (Spache, 

1962) and "main idea as implied or as topic sentence" (Smith, 1962). 

The problems with these types of statements whether as skills or as 
components of models should be obvious. 

2. There is a confusion of psychological and neurological 
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explanation and the connection between neurological and psychological 

terms is not specified. We find such statements as: 

...reverberatory activity, the continuous firing of a neuron circuit 
or network over a period of time. This activity appears to account 
for ability to comprehend sentences [Smith, 1962, p.24]. 

Substrata factors are thought of as neurological memory systems 
composed of smaller subsystems of the brain containing various kinds 
of information, such as audio, visual and kinesthetic associations 
which in a cultural milieu bestow a sense of reality upon symbolically 
represented thought units [Holmes & Singer, 1966, p.3]. 

These statements .sound more sophisticated than "finding the 
main idea of a paragraph" but the use of technical language suggests 
more precise knowledge than is possessed. It must be shown exactly 
how a "reverberatory circuit" 'ccounts for sentence comprehension, 
and how "neurological" sensory systems "...bestow a sense of reality 
upon symbolically represented thought units," and what is the set 
of symbols that represent thought units in the brain etc. Fodor 
(1968) has argued that adequate psychological theories are prerequisites 
to neurological explanations of psychological phenomena. 

3. None of the models is based upon an adequate description 

of the objects of the comprehension process i.e., a theory of language. 

4. Some of the models don't seem to generate any testable hypotheses 
Sparks and Mltzel (1966) have criticized the Holmes and Singer (1966) 
model on this basis. 

The models approach appears worth pursuing but better and more 
sophisticated models are needed. Models based upon a more careful 
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look at language and the range of phenomena to be explained, and 
models with a better idea of the requirements a model must meet in 
order to provide insight into a phenomena, are needed before this 
approach can bear fruit. 

Conclusion? Approaches to Comprehension 

Taken all together the great deal of effort expended in the 
various approaches to comprehension have produced very little knowledge 
of reading comprehension that is commensurate with the effort expended, 

As Spache (1962) points out we still do not know "...1) exactly what 
thinking processes operate in comprehension, 2) how may the reader^ 
facility in each of these processes be measured, and 3) how can ability 
in these processes be improved in instruction [ p . 63] ? M The accumulated 
knowledge of the comprehension process does not go much beyond Thorndike 1 $ 
description quoted at the beginning of this chapter. 

Comprehension Research and Peda g ogy 

The assumption underlying most educational research is that the 
effectiveness and therefore validity of pedagogical techniques is at 
least in* part dependent upon the extent to which techniques capitalize 
upon the actual psychological processes that students utilize in learning. 
All other things being equal, the more effectively the pedagogical 
technique takes advantage of the student's actual learning process 
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the more effectively the student will learn. 

In the case of reading comprehension, since very little is 
known ebout the comprehension process, the techniques for teaching 
comprehension are of unknown validity. 

As a result of this lack of knowledge of the process, the 
techniques for teaching comprehension tend to be global and superficial 
without any real methods of focusing upon specific areas of student 
difficulty. In much of the material for teaching comprehension the 
student is presented a passage to reaa and then asked a series of 
questions on what he has read. Tiese questions are stated in terms 
of the previously discussed skills, i.e. find the main idea, recall 
a detail, etc. If the student answers the question correctly, he 
then repeats the procedure on a different and more difficult passage. 

If he selrxts a wrong answer, he repeats the process on another passage 
of equal or lower difficulty. Ihere is no attempt when the student 
makes an error to show him why he made the error or how to go about 
finding the correct answer. 

In general when there is a lack of good description of psychological 
processes, the procedures for measuring performance are often used as 
a model for pedagogy. Consequently teaching procedures do not go very 



If this assumption is incorrect then there is no reason to base teaching 
techniques on the type of educational and psychological research that 
aims to shed Hght on basic learning processes. And consequently this 
research would have no practical instructional implications. 
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much beyond duplicating testing procedures* There is nothing in 
this procedure which allows for the correction of errors. For example 
the workbooks used in teaching vocabulary contain exercises that are 
almost identical to items ^n vocabulary tests. A further example is 
the use of the Cloze procedure to teach comprehension. 

In some cases there are attempts to go beyond testing procedures 
and to provide students with rules of thunb as principles. However these 
usually prove to be superficial and weak. For example in the case 
mentioned earlier of the skill of "distinguishing fact from opinion" 
the rule of thumb that is given to the students is a very superficial 
rule that doesn’t hold particularly well (Schell, 196?). 

Thus the pedagogical techniques for teaching comprehension leave 
a great deal to be desired. They boil down co practice in reading and 
testing and in some cases to superficial rules of thu b that don’t seem 
to work very well. This crude superficiality of the techniques for 
teaching comprehension is all that can be expected when so little is 
known about the comprehension process. In fact it Is difficult to 
justify all the voLiraJious rerearch in reading comprehension if the 
only practical results, beyond readability formulas, are the present 
techniques for teaching comprehension. Indeed the meagre results of 
this line of research are the bert argument for a new approach, one 
yielding increased knowledge of the comprehension process. This type 
of research t.ujld produce improved techniques for teaching comprehension. 
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In the next section some important reasons vhy the comprehension 
process has been resistant to explanation will be considered. These 
reasons will help in deciding upon the direction new work in comprehension 
should take in order to be more productive than past research. 

Why th e Comprehension Process has been Resi s tant to Explanation 

In searching for the reason for the lack of success of past 
reseai^h efforts in comprehension, it would be misguided to place 
the blame upon the poor methodology thct this research has exhibited. 

It could thus be assumed that methodologically more sophisticated 
research would produce meaningful results. While it is true that many 
of tne past stndies, as in any field of research, have had severe 
methodological difficulties (Cleland, 1964), improving methodology 
alone will not produce major breakthroughs in knowledge of the reading 
comprehension process, as the methodologically improved research over 
the past decade or so demonstrates. 

The reason for the lack of progress in comprehension research 
goes much deeper than poor research methodology. The current lack 
of descriptions of the mental processes involved in reading comprehension 
render it difficult if not impossible to establish adequate behavioral 
criteria for successful comprehension. This in turn is due to the 
fact that comprehension is a complex and covert mental process which 
along with other cognitive processes takes place without any overt 
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behavior being produced. Thus the behavior that is measured in research, 
i.e. answers to questions on comprehensicn tests, nay be only indirectly 
if at all related to the comprehension process. Furthermore, it. is 
difficult to distinguish, in anything but an arbitrary way, between 
behavior that reflects the comprehension process frc™ behavior that 
reflects other psychological processes — such as motivation, memory, 
attitude, attention, personality, etc. To put the problem simply, 
it is almost impossible to conduct fruitful empirical research ^?hen 
there is a lack of knowledge of which behaviors provide relevant measures 
of the process under investigation. 

Conprehension test developers have not really faced up to this 
problem. As a consequence there has been a great proliferation of 
comprehension tests measuring many different behaviors with no agree- 
man i among tests as to what the relevant behaviors axe or the best 
ways to measure them.^ Fodor (1968) points out this situation as 



4 

It is true that eye movements and pupil activity can be observed during 
comprehension. But these phenomena are difficult to interpret. Also, 
there are no doubt neurological changes that take place during compre- 
hension, but these are not easily monitored. Even when they can be 
monitored they also have resisted interpretation. The reason, as Fodor 
(1968) points out is that neurological explanations of oehavioy are 
dependent upon adequate psychological explanations which of course 
are lacking in the case of comprehension. 

**The difficulty in defining the appropriate behavior as a criterion for 
comprehension can be contrasted to the word recognition process in which 
there is a clear cut and easily definable behavioral criterion for 
succeasful performance i.e., correct oral pronunciation. For this reason 
word recognition appears to be. more amenable to empirical research. 
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typical of disciplines weak in theory. Spache (1962) describes the 
situation accurately: 

Test makers are prone to define comprehension in terms of the types 
of questions included in their instruments. If six types of questions 
follow each reading selection, it is assumed that six aspects of compre- 
hension or six types of thinking are being measured [p.61], 

This problem is not unique to psychology, Kuhn (1962) points out, 

it is characteristic of the early stages of scientific development: 

In the absence of a paradigm (theory) , all of the facts chat could 
possibly pertain to the development of a given science are likely to 
seem equally relevant. As a result, early fact gathering is far more 
a nearly random activity than the one that subsequent scientific 
development makes familiar [p.ISJ. (Parentheses mine) 

That comprehension research is at this "early fact gathering" 
stage with the concomitant Inability to separate relevant from irrelevant 
facts is evidenced by the proliferation of comprehension , skills and 
tests to measure them and the inability to distinguish comprehension 
from other psychological processes. The problem is not that more 
facts are needed but that it is impossible at present to give a coherent 
explanation of the facts that already exist. ^ 



The Need to Base Comprehension Research on Theory 



Clearly an important step that must be taken to help remedy the 
state of affairs that exists in comprehension research is to base 



The point of scientific inquiry is to provide through theories an 
explanation of phenomena. Facts or data are important only as they 
bear upon the truth of falsity of theory. They only make sense in 
light of theory. In the absence of theory facts are of minor importance 
to scientific inquiry. 
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future research on theory. Theory based research has the advantage 
of helping to provide, a principled way of separating relevant from 
irrelevant facts, of determining appropriate behavioral criteria, 
and suggesting important hypotheses that can be subjected to empirical 
research. Theory allows empirical research to be conducted on a 
rational and systematic basis. 

Unfortunately the term "theory 11 has been used rather loosely by 
reading researchers and some models and taxonomies have been mistakenly 
called theories. An examination of the criteria for scientific theories 
(Schutz, 1962) will quickly convince the reader that the term theory 
does not apply to most models and taxonomies,. 

Two Types of Theorie-st Competence and Performance 

There are two types of theories that are relevant to the study of 
psychological processes such as reading comprehension. These are 
theories of competence and theories of performance, (Fodor and Garrett, 
1966) , A competence theory is a theory of what someone knows while 
a performance theory is a theory of how someone uses this knowledge. 

In the case of reading comprehension a competence theory describes 
what is comprehended, i,e, the language and thoughts and ideas, while 
a performance theory describes how the comprehension process works. 

In other words a competence theory is a description of the object of 
the process while a performance theory describes the actual process, 
l,e, mental manipulations. 




41 



30 . 



An example will help clarify the competence/performance distinction. 
In the reading skill of learning to recognize words, the object of the 
process, i.e. what it is that is learned, is the relationship between 
printed words and the pronunciation of these words, A competence 
theory would in effect be a description of the relationship between 
the English spelling and sound system. The actual psychological 
processes that readers employ in decoding English spellings into 
correct pronunciations is described by performance theory. > 

The Need to Limit the Scope o f Comprehension Research 

It appears to be necessary to limit the scope of comprehension 
research to those aspects of comprehension for which there exist 
adequate competence. theories, i.e, descriptions of the tool employed 
in the process, namely language. This restriction is necessary because 
the development of competence theories is prerequisite to the development 
of performance theories. It is necessary to have a description of what 
it is that is learned before it is possible to conduct fruitful research 
into the learning processes. An example will clarify this point. In 
the putative comprehension skill of "fusing new and old Ideas" it seems 
pretty clear that before empirical study of this skill can be successfully 
undertaken, it will be necessary to define ideas, to develop a perfectly 
explicit method of counting ideas, to develop an unequivocal procedure 
for analyzing text into its component ideas etc. An adequate competence 
theory of ideas would, if it existed, provide this information. This 
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theory would then provide a principled way of counting ideas as 
well as help to distinguish between relevant and irrelevant behavior 
in the development of behavioral criteria for measuring the fusion of 
ideas in comprehension ♦ A competence theory oZ ideas would also 
suggest testable hypotheses about the fusion process which would 
eventually result through empirical research in a performance theory 
of the process by which readers "fuse new and old ideas." Without 
a competence theory of ideas it is almost impossible to know where 
to begin. 

The lack of a theory of ideas has result'? 1 in research in which 
either ideas are so narrowly defined as to lose their resemblance to 
the original concept of an idea or in research in which such global and 
vagde definitions of ideas are employed that the results are virtually 
uninterpretable. In both cases not much light has been shed on the 
nature of ideas or upon the ways in which readers learn to "fuse 
new and old ideas" in comprehending written matter. 

Many of the attempts to understand the comprehension process 
have been less than successful precisely because of the lack of this 
competence theory oi the object of the process upon which to base 
research. Without this competence theory the research has been 
reduced to almost "random fact gathering," 

Furthermore, it should be noted that basing comprehension 
research on available linguistic competence theory means that this 



43 



32 . 



research at present rau?t be limited to attempts at understanding the 
processes involved in comprehending the literal meaning cf sentences. 

This limitation is necessary because linguistic competence theory 
has only advanced to the stage of dealing with the literal meaning 
of sentences, and there is little, if any, theory that deals with 
anything beyond the sentence level. Moreover, the literal interpretation 
of sentences is a good starting point for comprehension research because 
most of the more global comprehension skills discussed earlier assume 
a literal interpretation as a prerequisite. As Huey (1908) points 
out, ''Language begins with the sentence and this is the unit of 
language everywhere [p . 123] . " 

Deep Structure as a De scription o f the Object of the Comprehension Process 

Giv^n that reading comprehension research should be limited to 
those aspects of comprehension for whxch a theory of language currently 
exists, the question to ask is: for vhat aspects of comprehension do 

there exist competence theories? It seems clear that one aspect of 
what is understood in comprehension is the underlying structural relation- 
ships of sentences. For example in the sentences: 

(a) John is eager to please 

(b) John is easy to please 

• The reader must understand that in (a) John is the one who is doing the 
pleasing and the person that is pleased is unspecified, while in (b) the 
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reader must understand that it is John who is pleased and the person 
doing the pleasing is unspecified. This information is necessary 
to understand a sentence. To comprehend a sentence the reader must 
understand the underlying structural relationships i.e., the logical 
subject and logical object of the sentence. Thus one aspect of the 
object of the comprehension process is the underlying structural 
relations of sentences. Fortunately there does exist a competence 
theory that describes these relationships. This theory is generative 
or transformational grammar (henceforth T.G.), as developed by Noam 
Chomsky (1957, 1965). In T.G. it is the deep structure that captures 
these underlying structural relationships. ^ (See Jacobs & Rosenbaum, 
1968; and Langacker, 1967 for readable descriptions of transformational 
grammar). T.G. thus provides the competence theory of the object of 
the; comprehension process upon which the research reported here is 
based. 

But before describing this study it is necessary to ask whether 
or not the deep structure of sentences as described by T.G. is 
psychologically real? In other words is the deep structure sctually 
employed in language comprehension? The studies bearing on this 
question will be discussed in the next section. 



These relationships are deep or underlying in the sense that they 
are not usually given in the surface structure of sentences. This 
can be seen in the example above in which (a) and (b) both have the 
game surface structure but different deep structures. 
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The Psychological Reality of Deep Structure 

A 'number of studies have attempted and have been without 
exception successful in demonstrating the psychological reality of 
deep structure. They have done this by shoving that deep structure 
differences with surface structure held constant are consistently 
reflected in recall* recognition* comprehension and learning of 
sentences, (Bever, Mehler & Carey, 1967; Blumenthal, 1967; Blumenthal 
& Boakes* 1967; Clark* 1969; Davidson 6 Dollinger, 1969; Levin 6 Wanat, 
1967; Levelt, in press; Mehler 6 Carey* 1968; Wanner, 1968), A 
representative sample of these studies is described below, 

Blirmenthal (1967) compared subjects' recall of two types of 
sentences. Recall was aided by a prompt word taken from the sentence. 
Both types of sentences had the same surface structure but they differed 
In their deep structures. The first type was a standard passive* e.g.* 
"Gloves were made by tailors , u In the second type of sentence the 
by-phrase was replaced with a non-agent adverbial by-phrase* e.g. 

"Gloves were made by hand." In the first type "tailors" is the deep 
structure logical subject of the whole sentence* e.g., tailors made 
gloves. In the second sentence "hand" is a verb modifier and does 
not relate to the whole sentence in the deep structure. 

The results of the experiment support the psychological reality 
of deep structure. Blumenthal found that the first type of sentence 
was recalled more easily than the second type. He also found an 
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interaction between prompt words and sentence types. When the initial 
noun was the prompt word, there were no differences in the ability 
of subjects to recall the t^o types of sentences, But when the final 
noun was the prompt word, the standard passive, e.g., "Gloves were 
made by tailors , 11 was nore easily recalled than the second type, e.g., 
"Gloves were made by hand. 11 These results could be predicted by the 
differences in the deep structure relations in the sentence*. In the 
first sentence, "tailors" is the deep structure subject of the whole 
sentence, while in the second sentence, "hand" lo related only to 
a part of the sentence as an adverbial. 

in a subsequent study in which different types of sentences were 
used and a more difficult recall task employed, Blumenlhal and Boakes 

» 

(1967) provided further confirmation of the psychological reality 
of deep structure. As in the previous experiment, they used sentences 
with the same surface structure but with different deep structures, 
e.g., "John is easy to please," and "John is eager to please." In 
the first sentence "John" is the object of the sentence and the adjective 
"easy" is related to the whole sentence as a modifier, i.e. a paraphrase 
of the sentence could read "For someone to please John is easy," In 
the second sentence "John" is the subject of the sentence and the 
adjective "eager" is related to only part of the sentence as a noun 
modifier. The results were the same as in the earlier study. The 
sentence modifier, e.g.^ "easy", when given as a prompt word, produced 
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better recall of its sentence than the noun modifier, e.g,, "eager", 
did for its sentence. 

Further evidence is provided by Levin and Wanat (1967), They 
measured Eye-Voice-Span (the amount that a Ss 1 eyes are ahead of 
his voice when he is reading aloud) of subjects reading passive 
sentences that had the same surface structure but different deep 
structures. Two types of passive sentences were used. One type 
was the standard passive, e.g,, "His brother was beaten up by the 
gang." In the other type the underlying subject cf the sentence was 
deleted, e.g.V "His brother was beaten up by the park." They found 
a larger EVS for the first sentence type than for the second sentence 
type. Thus deep structure differences produce differences in Eye- 
Voice-Span, 

Finally, a study by Bever, Mehler and Carey (1967) demonstrated 
that eye fixation patterns during reading were also influenced by 
differences in surface and deep structures of sentences. They used 
ambiguous sentences, i.e., sentences that can be interpreted in more 
than one vay. Two types of ambiguities were used in this study — 
surface structure and deep structure ambiguity. Surface structure 
ambiguities are those in which ambiguity is contained in the grouping 
of the words of the sentence. An example of surface structure ambiguity 
is "They gave her dog candies," This sentence lias two interpretations; 
They (gave (her dog) candies), i.e., candies were given to her dog, 
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and They (gave her (dog candies)) , i.e, dog candier were given to her. 

A deep structure ambiguity is in the underlying logical relations of 
the sentence. "The shooting of the hunters..." is ambiguous at the 
deep structure level. It has two interpretations: (The shooting (of 

(the hunters))) I.e., the hunters were shooting, and (The shooting (of 
(the hunters))) i.e., somebody wa;; shooting the hunters. Notice that 
while the deep structures differ, the surface structures as denoted by 
the parentheses remain the same. 

The eye fixations of the subjects were recorded as they read 
stories five sentences long in which the ambiguous sentence was the 
fourth one. The story context made one interpretation of the ambiguous 
sentence highly probable. The eye fixation patterns were compared 
for the tvo interpretations of the ambiguous sentence. They found that 
deep structure as well as surface structure differences were reflected 
in fixation patterns. 

These studies taken all together provide ample evidence for the 
psychological reality of deep structure. Thus T.G. provides a psycho- 
logically real description of the object of the comprehension process » 
i.e., the deep structure. This theory provides the basis for the study 
of reading comprehension reported in this dissertation. 
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CHAPTER II 

An Investigation of Deep Structure and Reading Comprehension 
The Purpose of the Study 

The study described in this paper is concerned with the relevance 
of a specific linguistic concept — deep structure — to leading compre- 
hension* Ths study has two concerns: first to investigate the relation- 

ship between children’s skill at recovering the deep structure of 
sentences and their reading comprehension skill, and second to investigate 
one strategy that 3tudents might use in recovering the deep structure 
of sentences and to determine the relationship of this strategy to 
reading comprehension* 

H ypoth e ses and Assumptions of the Study 
Hypothesis is Children's skill at recovering the deep structure of 
sentences is positively related to reading comprehension skill* 

This hypothesis rests upon the following assumptions: 

1« Comprehending a sentence minimally involves recovering the 
deep structure of that sentence* 

2* Understanding sentences is positively related to understanding 
connected discourse. 

An objection could be raised here. It might be argued that since 
recovery of deep structure is almost by definition an aspect of sentence 
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comprehension, then reading comprehension of connected discourse must 
involve recovery of deep structure because reading comprehension 
involves understanding sentences. According to this line of reasoning 
hypothesis 1 is a tautology and bound to be confirmed by the data. It 
is, on the contrary, not at all clear that this hypothesis will produce 
positive results, for the following two reasons: 

1. Recovery of deep structure might be such a fundamental 
skill that children all employ it equally well. Thus 
there would be no individual differences in this skill. 

2. There may be differences in children’s skill at recovering 
deep structure, but this skill is totally submerged by 
other factors such as word knowledge, past experience, 
motivation, attention etc. in accounting for reading 
comprehension. 

For either of these two reasons correct recovery of deep structure 
could play little if any role in reading comprehension and consequently 
hypothesis 1 would not be confirmed. 

Hypothesis 2: Children’s skill in making a lexical analysis of the main 

verb of sentences is positively related to their reading comprehension 
skill. 

This is a strategy that children may in recovering the deep 
structure of sentences l.e. a lexical analysis of main verbs, has been 
suggested by Fodor, Carrett, and Bever (196S). They suggest that 
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•'The exploitation of the lexical analysis of the main verb of a 
sentence is a central heuristic in the strategy Ss use to rt. cover 
its deep structure [p«459]." This strategy involves analysis of the 
main verb of the sentence to determine the possible deep structures 
with which it is compatible. This information is formalized in the 
grammar by the strict subcategorization rules which classify verbs 
according to the deep structure configurations with which they are 
compatible (Chomsky, 1965). For example, some classes of verbs can 
take mbre deep structures than others. Thus, the verb "hit 11 is a 
pure transitive and can only take one deep structure as in (a), 

(a) The boy hit the girl. 

The verb "know' 1 , however, can take two deep structures, a transitive, 
as in the sentence (b) , 

(b) The boy knows the girl, 

and a complement as in the sentence (c) , 

(c) The boy knows that John is smart. 

If the strategy for recovery of the deep structure involves the 
> Information Contained in the strict subcategorization rules, then all 

other things being equal, verbs such as "know" should require more complex 
processing than verbs such as "hit" because the former entertains more 
possible deep structures than the latter. 

Fodor, Garrett, and Bever (1968) have provided evidence in support 
of this strategy by demonstrating that sentences containing verbs that 
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can take either a complement or transitive deep structure, e.g. The 
box the man the child knew carried was empty, are more difficult 
and take longer to paraphrase than the same sentence with knew 
replaced by a main verb that can take only a transitive deep structure 
e.g, The box the man the child met carried was empty. 

Hypothesis 2 rests on assumption 1 and 2 previously stated plus 
the following assumption. 

3. A lexical analysis of the main verb is a part of the strategy 
that is used in the recovery of deep structure. 

In the next chapter the measurement of the variables of these two 
hypotheses will be described. 
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CHAPTER III 

The Measurement of Variables Described 
Measurement of the Variables for Hypothesis 1 

Hypothesis 1: Children's skill at recovering the deep structure 

of sentences is positively related to reading comprehension skill. 

Independent Variable; Skill at recovering the deep structure of sen t ences . 

This variable is measured by a 25 item test developed by the author 
for this study, The items on this test require Ss to choose the one 
sentence out of three that is not a paraphrase of the other two. Some 
examples follows^ 

1* a. Th'j boy gave the book to the girl, 

b. The book was given the girl by the boy, 

*c, The book was given to the boy by the girl, 

2. *a. What the boy would like is for the girl to leave. 

b. For the boy to leave is what the girl would like, 

c. What the girl would like is for the boy to leave, 

3. a. He painted the red house, 

*b. He painted the house red, 

c. He painted the house that was red. 

*The asterisk Indicates the sentence with a deep structure different 
from the other two sentences, 
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4. a. The girl asked the boy when to leave. 

b* The girl asked the boy when she should leave. 

*c. The girl asked the boy when he should leave. 

5. *a. The girl who the boy hit fell down, 

b. The boy the girl hit fell down. 

c. The boy who the girl hit fell down. 

The complete test is reproduced in Appendix A. This test will 
henceforth be called the Deep Structure Recovery Test (D.S.R.T.). 

A close look at the test reveals the characteristics of the items. 
In each item there are two sentences that have the same deep structure 
and the same meaning. These two sentences have different surface . 
structures due to the application of one or more extra transformations 
to one of them. The third sentence has a surface structure that is 
the same or similar to one of the other two sentences, but it has a 
different deep structure and consequently a different Meaning from 
the other two sentences. An example will clarify this description. 

In the sample Item, 

a. The boy hit the girl. 

b. The girl was hit by the boy. 

*c. The boy was hit by the girl. 

a and b both have the same deep structure but have different surface 
structures in that b has had the passive transformation applied to it. 

C has the same surface structure as b but has a different deep structure 
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than b because in c "girl" is the logical subject while in b and a 
"girl" is the logical object. The same reversal of deep structure 
relationships hold for ’boy," In addition the same content words, 

i.e. nouns i .wl verbs, are used in all three sentences of each item. 
Overall in each item the sentence with the different deep structure 
from the other two sentences is the same in almost all characteristics 
as at least one of the other sentences except that it has a different 
deep structure. 

Validity of the D.S.R.T. The validity of the D.S.R.T., as a 
measure of the ability to recover deep structure, rests on the follow- 
ing assumptions: 

1. Sentences that are paraphrases of each other have the same 
deep structure. 

2. It is necebsary to recover the deep structure in order to 
determine whether or not sentences are paraphrases of one 
another » 

3. Choosing the correct answer on items of the D.S.R.T. indicates 
that the deep structures of at least two of the three sentences 
in each item have been recovered. 

4. The D.S.R.T. ic measuring the recovery of deep structure. 

It is not measuring certain other variables such as word 
recognition and word knowledge that are associated with 
reading comprehension. 
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5. Ths statistical properties of the test are adequate* 

Each of these assumptions will be discussed below. 

Assumption Is Sentences that are paraphrases of each other have the 
same deep structure. 

This is simply a fact of language as it is described by T.G. 

If T.G. is wrong about this fact, then the assumption is false. If 
T.G. is correct, then ’;he assumption is correct. Until T.G. is proven 
to be wrong as a theory of language competence or the theory changes 
so that this assumption no longer is a fact of language, it seems 
reasonable to accept the correctness of assumption 1. 

Assumption 2 : It is necessary to recover the deep structure in order 

to determine whether or not sentences are paraphrases of one another. 

The evidence presented in Chapter I that recovery of deep structure 
is necessary for understanding i sentence plus assumption 1 offer strong 
support for this assumption. The line of reasoning supporting this 
assumption is that understanding 8 sentence involves recovery of the 
deep structure of that sentence. If two sentences have the same deep 
structure l*e., have the same meaning, an S must recover the deep structure 
of both sentences in order to determine if they are paraphrases of each 



Assumption 3: Choosing the correct answers on items of the D.S.R.T. 

indicates that the deep structures of at least two' of the three sentences 
in each iten have been recovered. 
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In order to show the validity of this assumption it is necessary 
to rule out other alternative ways of getting the correct answer or 
ways that do pot require recovering deep structure of the sentences. 

The first alternative is that Ss get the correct answers by random 
guessing. This alternative can he ruled out by looking at the mean 
total scores on the test for the sample studied and comparing this 
score to the score that is possible by random guessing. Since there 
are three choices per item, Ss have 1/3 chance of getting the right 
answer on each item by guessing. There are 25 items on the test. 

This makes 8.33 a chance score on the total test assuming that all 
choices are equally probable r The mean score of the sample studied 
was found to be 18.575. This is sufficiently different from 8.33 to 
rule out random guessing as an alternative. 

A second related alternative is that Ss choose some set position 
i.e. first, second, or third sentence in doing the test. This alternative 
was controlled for by attempting to balance the position of the correct 
answer over all 25 Items. An error in this procedure produced the 
following partially unbalanced distribution of correct answers over 
positions: 

(a) First position -- 8 items 

(b) Second position — 7 Items 

(c) Third position — 10 items 

However, even if Ss adopted an optimal position strategy and 
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chose the third position their scores would be 10 correct. This 
score is sufficiently different from the Ss f mean score of 18.575 
to rule out the alternative that Ss choose the answer by its position. 

A third alternative is that the length of the sentence influences 
the Ss 1 choice. There are a few different possibilities here. Ss 
could adopt a strategy of choosing the shortest sentence in each item 
or the longest sentence without reading the sentences at all. Or 
they could actually read the sentences and choose the longest sentence 
because it is the most complex. Thus the complexity of the sentence 
somehow determines performance on these items rather than skill in 
recovering deep structure. 

A look at the items and their sentence length appears to rule 
cut this alternative. In one item fno.2) all the sentences are the 
same length. In two items (nos. 19, 24) the shortest sentence is 
the correct answer. In two items (nos. 7 r 14) the intermediate 
length sentence is the correct answer. In three items (nos. 1, 13, 

17) the longest sentence is the correct snswer. In five items (nos. 

3, 6, 8* 16, 20) the correct answer is one of two sentences both of 
which are the same length but shorter than the third sentence. And 
finally in twelve items (nos. 4, 5, 9, 10, 11, 12, 15, 18, 21, 22, 23, 
25) the correct answer is one of two sentences of the same length both 
of which are longer than the third sentence. There is no plausible 
strategy that Ss could adopt that would result in a score close to 
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the 18.575 average found. For example an optimal strategy of choosing 
the longest sentence and in the cases where there are two longest 
sentences chose one randomly would only result in 9 or 10 correct. 

It appears that the sentence length alternative can be safely ruled 
out . 

A fourth alternative strategy that Ss might adopt for choosing 
the correct answer without recovering the deep structure of sentences 
in the items is to use some superficial characteristic of sentences. 

For example in item 13 1 

13. a. The boy beg g ed the girl to tell the truth. 

b. What the boy begged the girl to do was tell the truth. 

*c. To tell the truth was what the girl begged the boy to do. 

Ss might adopt a strategy of comparing the phrases underlined here and 
determine that c is the correct answer because the phrase is reversed. 

Or Ss might look at the first occurrence of the nouns "boy" and "girl" 
and choose a sentence in which the first occurrence of "boy" or "girl" 
is different from the other tv/o sentences. Or Ss could use the same 
strategy with the second occurrence of "boy" or "girl," All these 
strategies and other similar strategies do not involve recovery of 
the deep structure of the sentences > in fact they hardly even require 
Ss to read the sentence at all. 
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There are 8 items out of the 25 in whirh the use of superficial 
clues will yield the correct answer. These items, with their super- 
ficial clues underlined, are presented in Table 1 along with their 
item difficulty. 

In the remaining 17 items of the D.S.R.T. (nos# 1, 3, 4, 6, 7, 8, 
9, 10, 11, 12, 14, 16, 19, 20, 21, 23 & 25), there ere no superficial 
clues that Ss could use in choosing the correct answer rather than 
recovering the deep structure of the sentences. Thus the superficial 
clue strategy alternative can be ruled out for these 17 items* These 
items will be referred to as the No Sup. Clues items* 

In order to examine Ss 1 use of the superficial clue strategy in 
the 8 items in Table 1 this strategy is actually possible, it is 
necessary to compare Ss 1 performance on the 8 items which contain 
superficial clues (Sup. Clues) with Ss f performance on the 17 items 
that do not contain superficial clues (No Sup. Clues). A It ok at 
the item difficulties in Table 1 reveals that 6 out of the 8 items 
are quite easy. Most Ss chose the correct answer on these items. 

A t test between the average scores on the Sup. Clues Items and the 
average scores on the No Sup. Clues items reveals that the Sup. Clues 
items were significantly (p<.001) easier than the No Sup. Clues 
Items. This finding would tend to support the conclusion that; Ss 
were using superficial clues on the Sup. Clues items. S 3 go'; more 
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Item l 

2« *a, 

b. 

c, 

5. 

b. 

c. 

13 • a • 
b. 
*c. 

15. a. 
*b, 
c « 

17. a, 
b. 

*c, 

18. *a. 

b. 

c. 

22 . • < 
b. 
*c. 

24. a. 
*b. 
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TABLE 1 



ITEMS WITH SUPERFICIAL CLUES 
(Superficial Clues Underlined) 



N*103 


Item 

Difficulty 


What the boy would like is for the girl to leave. 
For the bov to leave is what the girl would like. 
What the girl would like is for the boy to_leaye T 


.709 


The girl who the boy^hit. fell down. 




The. boy the girl hit fell down. 


.495 


The boy who Che.gjjdLMt fell down. 




The bov beaded the girl to tell the truth. 




What the boy begged the girl to do was tell the truth 


. .922 


To tell the truth wrs what the nirl beeaed the boy 
to do. 




The girl. taught, the boy to use a pencil. 

What the boy taught the girl was to U9e a pencil. 
To use a pencil was what the ^lrlJtaught the boy. 


.903 


The girl ordered the boy to sit down. 

What the airl ordered the bov to do was sit down. 

To sit down was what the boy ordered the fcirl to do. 


.961 


The glyj, who the boy knew vent home. 
The bov the Kiri knew went hone. 

The boy who the girl knew went home . 


.631 


The bov sees that the girl is tall. 

That the girl- is tall ia seen by the boy. 

What the girl sees is tha^the boy_la tall. 

What the girl wants is for the boy to find the ball. 


.825 


The boy wants the girl to find the ball. 


.874 


For the toy_to .find, the ball is what the Rirl wants. 








51 . 



right answers on these Items than on the No Sup. Clues items where 
they had to depend upon the recovery of deep structure exclusively* 

Further evidence bearing on this conclusion can be found in the 
correlations between the Sup. Clues items and the No Sup. Clues items 
and the dependent variable of this study, reading comprehension as 
measured by a Cloze test and the Reading Subtest of the Metropolitan 
Achievement Test. (See discussion of the dependent variable on pp. 57-61 
for the rationale for these measures). This data is presented in 
Table 2. 

TADLE 2 

Correlations between Sup. Clues and No Sup. Clues Items 
and Reading Comprehension (N®87) 





Reading Comprehension 


Variable 




Cloze 


MAT Reading 


Sup. Clues 




.536** 


.393** 


No. Sup. Clues 




.719** 


.441** 



**p < .01 



This data shews that the Sup. Clues items correlate significantly 
with both measures of the dependent variable. These correlations are 
higher for the No Sup. Clues items than for the Sup. Clues items and 
thib difference is significant at the .05 level for the Cloze criterion 
but nonsignificant for the H.A.T. Reading test. 
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The similar correlations between the Sup. Clues and the No 
Sup. Clues items and the criterion measures suggest that the superficial 
clue strategy is used very little. The lowered correlation for the 
Sup. Clues items can be explained by a ceiling effect in which nearly 
ZQ% of the Ss got a perfect score of 8 on the test. This ceiling 
effect would of course tend to lower the correlation. The evidence 
presented thus far is not sufficient to rule out the use of superficial 
clues for the Sup. Clues items. However the effect on the results 
of the study is not important because a complete analysis with the 
No Sup- Clues items omitted produced essentially the same results 
as the analysis which included these items. Thus the Inclusion 
of the 8 Sup. Clues items has no effect on the results or conclusions 
of this study. In this study, then, only the results from the total 
D.S.R.T. will be reported here. 

Assumption 4: The D.S.R.T. is not measuring certain other variables 

associated with reading comprehension in addition to skill at recovering 
deep structure. 

In order to make the D.S.R.T. a clean measure of the ability to 
recover deep structure, it is necessary to make sure that the D.S.R.T. 
is not measuring other variables that are associated with reading 
comprehension. If this is not done and the D.S.R.T. turns out to 
correlate with reading comprehension, then it will be hard to tell 
vnether this correlation is due to skill at recovering deep structure 
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or to some other variable that is being measured by the D.S.R.T. 
which correlates with comprehension. 

One variable which the D.S.R.T. could be measuring and which 
is also related to reading comprehension is word recognition skill. 

It is possible that the Ss are making errors on the D.S.R.T. not 
because they can*t recover deep structure but because they cannot 
recognize the words in the D.S.R.T, This possibility was ruled out 
in two ways: (a) The test was composed of a small set of 78 different 

words from a vocabulary for the primary grades developed for remedial 
work by Durrell (1956). Since the sample for the study was fifth 
grade students, it seems reasonable to assume that the words on the 
D.S.R.T. were easy to recognize, (b) In order to be absolutely certain 
that the Ss for this study are able to recognize the words on the D.S.R.T. 
the following procedure was employed. Ss wore rank ordered on the basis 
of word recognition ability as measured by the Word Discrimination 
Subtest of the Metropolitan Achievement Test. Then beginning with 
the Ss with the lowest scores on Word Discrimination Test, Ss were 
tested individually on their ability to recognize the 78 individual 
words on the D.S.R.T, The Ss were required to pronounce all 78 words. 
Every S with a Word Discrimination score below 5.0 was tested as it was 
assumed that the rest of the Ss who had scores higher than 5.0 on the 
Word Discrimination test would be able to recognize the words. Only 
9 Ss made errors on the 78 words. Seven made only .1 error, 1 made 
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2 errors and 1 made 4 errors. Ss were eliminated if they made mistakes 
on any item of the D.S.R.T. which contained words which they had failed 
to recognize on the oral pronunciation test. Using this criterion, 

7 of the 9 Ss were eliminated from the sample. Therefore the alternative 
that the D.S.R.T. measures the ability to recognize words can be ruled 
out because words were easy to recognize and those Ss with trouble 
were eliminated. 

Another variable which the D.S.R.T. could be measuring is Ss 1 
knowledge of word meanings, Ss* errors on the D.S.R.T. uay be due 
to their lack of knowledge of the meanings of the words on the test 
rather than their ability to recover deep structure. Since word knowledge 
is generally correlated with reading comprehension, a positive correlation 
between the D.S.R.T. and reading comprehension nay be due to Ss 1 knowledge 
of the word meanings on the test rather than their ability to recover 
deep structure. 

There is reason vO believe that the knowledge of the meanings 
of the words on the D.S.R.T. is not a major factor in performance on 
this test for the sample studied. The words used are at primary grade 
level (Durrell, 1956) and are on the list of easy words used in the 
Dale-Chall readability formula. In addition, from a simple examination 
of the words on the test, it appears that they sre quite easy and common. 
Finally, the sample studied were fifth grade students who were on the 
average above grade level on the Word Knowledge subtest of the M.A.T. 
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and who had an average I.Q, of 117* Given this information about 

the difficulty of the words on the test and the ability of the Ss* 

it seems highly unlikely that these Ss do not know the meanings of 
the words on the D.S.R.T. 

Assumption 4 is tenable in that the D.S.R,T. does not appear 
to be measuring word recognition skill or knowledge of word meanings. 

Assumption 5: The statistical properties of the test are adequate. 

On the basis of data collected on 103 fifth grade students 

2 

the following statistical properties of the test were founds 

1. The item difficulties ranged from .282 to .96. 

2. The internal validity as measured by a biserial correlation 

between the test items and the total score showed items with 
internal validities ranging from .358 to .842. 

3. The K’ider-Richardson 20 reliability of the total D.S.R.T. 
is .7964. 

See Appendix B for the Complete item statistics of the test. 
Table 3 presents the basic statistics for the total D.S.R.T, 



this same population was used to test the hypotheses of the study. 
Thus there was no independent statistical validation of the D.S.R.T. 



O 

ERIC 



67 



56 . 



TABLE 3 

Basic Statistics of the Total D.S.R.T. 
(Sum of Items correct) N«103 



Variable 


* 1 


■ Median 


Range 


S.D. 


Skewness 


D.S.R.T. (total. 
25 Items) 


18.575 


2O.C0 


3.25 


4.258 


-0.767 



The following concluctons can be drawr. from this data concernirg 
the statistical properties of the test; 

1. The reliability of .the test is acceptable but slightly low. 

2. The test is relatively easy with an average total score of 
18 « 575 out of 25 items. A number of items were quite easy, with item 
difficulties over .7. 

3. The items for the most pat t are good discriminators with moot 
of the items exhibiting blserlal correlations well over .5. 

Taken together this data makes assumption 5 tenable. The statls- 
tlcal properties of the test are adequate| (Guilford, 1951). 

In summary, all the assumptions upon which the validity of the 
D.S.R.T. rests appear to be sound. The D.S.R.T* by meeting these five 
assumptions is a valid test of skill in recovering the deep structure 
of sentences. Consequently It has construct validity; A relationship 
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between the measurement and the underlying skill, i.e. ability to 
recover deep structure, ha* been established. 

The Dependent Variable: Reading Comprehension Skil l 

As discussed in Cl pter 1 the measurement of reading comprehension 
skill presents a difficult problem because not much is known about the 
reading comprehension process. As a result, in measuring comprehension 
skills it is difficult to identify the relevant behaviors that must be 
measured as well as to distinguish the measurement of comprehension 
skills from other student skills, Until a theory of the comprehension 
process is developed there will be no completely satisfactory solution 
to this problem. It is possible, however, to distinguish better 
measures of comprehension from poorer measures of it even though they 
all lack construct validity, 

. The best measure currently available for the measurement of 
reading comprehension is the Cloze test as discussed in Chapter 1, The 
reasons that the Cloze provides a better measure of comprehension than 
the traditional standardized test are discussed below: 

1. As discussed in Chapter 1 the Cloze test measures fewer 
extraneous aspects of student functioning than traditional 
tests do. 

2, Cloze tests provide a measure of comprehension as this process 
is in progress, not after the process is completed as is the 
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case with traditional tests. In this way the Cloze test is 
a more direct measure of the comprehension process than 
traditional tests. 

3. Reliabilities of Cloze tests (Weaver & Kingston, 1963) compare 
favorably with reliabilities of traditional tests and meet the 
standards for good test construction. For example Weaver 6 
Kingston (1963) found reliabilities in the high .80*8 and 
low ^O's. 

The concurrent validity of Cloze tests ss a measure of comprehension 
has been demonat rated by a number of studies which have shown correlations 
between the Cloze and traditional measures of comprehension ranging from 
.50 to .80 i (See Potter, 1968 for a summary of these studies). In 
addition Cloze tests have been used in readability studies end have 
been shown to be as good or better measures of comprehension than 
traditional teats (Bormuth, 1966; Potter, 1968). 

The evidence on the concurrent validity of Cloze tests indicates 
that they are equally good meast res of reading comprehension as tradition- 
al tests, while the three points presented above indicate that Cloze 
tests are more direct measures of the comprehension process. Since 
this study is concerned with the comprehension process, the Cloze 
test is the major measure of reading comprehension in it. 

The Cloze Tests of Comprehension Used in this Study. Three ISO 
word passages were selected from a set of passages calibrated on 473 
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college students by Miller and Coleman (1967). Th< passages come 
from The McCall Crabbs Standard Test Lessons In Reading (1961). The 
paragraphs are reproduced In Appendix C, The difficulties of the 
passages as determined In the Miller and Coleman study are presented 
in Table 4. 

TABLE 4 



Cloze Passage Calibration 



Scoring Criterion 


Cloze Test 
Passage 1, 


Cloze Test I 
Passage 2, 


Cloze Test 
Passage 3. 


A. % Correct Responses 

(Every 5th word deletion) 


78.9 


68.5 


55.7 


B. X Correct Responses 
(One word per passage 
deletion) 


88.2 


75.3 


66.7 


C. % Correct Responses 

(S guesses every void after 
seeing preceding word) 


42.1 


34.4 


35.0 


D. X New Correct Responses 
(Ss repeated C above) 


38.8 


42.1 


43.0 


E. Total number of Correct 
Response' on A.B«C. 
(2400 possible correct 
responses) 


1389 


1215 


1107 


F. McCall Crebbs grade level 
score for 90% comprehension 


7.0 


9.0 


6.6 


C. McCall Crabbs grade level 
score for 70% comprehension 


6.7 


6.0 


6.0 
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All three passages were prepared with every fifth word deleted 
i.e., 30 deletions per passage. Passage* 1 and 2 were administered 
in the standard way, Ss were simply instructed to fill in the blanks. 
For Passage 3 a slightly different procedure was used. Ss first 
read the intact passage. Then without the intact passage to refer 
to they were given the same passage as a Cloze test with every 5th 
word deleted. There vere no time limits on the test. For each test 
the number of exact correct replacements was totaled. Table 5 presents 
the results of the test on 6? 5th grade students plus the Rulon split 
half reliability (Guilford, 1951). All three tests have high relia- 
bilities. 

TABLE 5 

Basic Statistics on Three Cloze Tests (N«87) 



Variable 


No. 

Items 


X 


Median 


S.D. 


Range 


Skewness 


Rulon 

Reliability 


Cloze Tes 4 ^1 


30 


17.724 


18.00 


3.646 


5-25 


-0.701 


.884 


Cloze Test #2 


30 


16.138 


16.00 


4.243 


5-27 


-0.024 


.924 


Cloze Test #3 


30 


16.023 


16.00 


3.782 


4-24 


-0.347 


.893 



The three Cloze tests were converted to T scores and cammed to 
form a composite Cloze measure of reading comprehension. This measure 
was used as the dependent variable of the study. Table 6 presents the 
Intercorrelations of the three Cloze tests plus the composite test. 
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TABLE 6 

Intercorrelations: Cloze Tests* (N**87) 



Variable 


1 


1 


3 


4 


X 


S.D. 


1. Cloze #1 


1.000 


.497 


.491 


.803 


17.724 


3.646 




2. Cloze 1/2 




1.000 


.563 


.834 


16.138 


4.243 




3. Cloze i/3 






l.oon 


.832 


16.023 


3.782 




4, Composite 
Cloze 
(T score) 










150.000 


24.712 





*A11 correlations are significant at the .01 level or less. 



In addition to the Cloze test described above a traditional 
measure of reading comprehension was used. This test is the Reading 
subtest of the Metropolitan Achievement Test (M.A.T.). This test 
has all the Inadequacies of traditional comprehension tests as described 
in Chapter 1. This test is only included to provide a basis of compari- 
son with a traditional comprehension measure. The data on the M.A.T. 
Reading is presented in Table 7. 



TABLE 7 

Basic Statistics on M.A.T. Reading a 



Variable 


X 


Median 


S.D. 


Range 


Skevness 


M.A.T. Reading 


4.984 

(5.4) 

.. 


5.2 

(5.7) 


1.112 


2. 2-6. 5 
(2.8- 8.4) 


-0.605 



a Locai norms (National norms in parentheses) 
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Control Variables 

In testing a hypothesis in which a relationship between an 
independent variable and a dependent variable is hypothesized, it 
is necessary to rule out alternative explanations of the findings 
by controlling other variables that may be related to both the inde- 
pendent and dependent variable and thus account for any relationship 
found between the independent and dependent variable. For the hypo- 
thesis 1 of this study a finding of a positive relationship between 
the D.S.R.T. and the Cloze test may be due to some other third variable 
rather than the hypothesized variables. These third variables must 
be controlled and the relationship of D.S.R.T. with the Cloze test 
must be looked at with the eifect of these variables removed. The 
three control variables for this study are: 

1. Knowledge of word meanings as measured by the Word Knowledge 
subtest of the M.A,T, Elementary Battery Form B, In this 
test the word to be defined is presented in a brief sentence 
with one word underlined. The student is required to select 
from four choices the word which best completes the sentence. 
The word is usually a synonym for the underlined word, 

2* Word recognition skill as measured by the Word Discrimination 
subtest of the M.A.T, Elementary Battery Form B. This test 
consists of sentences with a word missing. The student selects 
the proper word co complete the sentence from among several 
words differing in conf igurstion and meaning. 
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3. Intelligence (I>Q t ) a9 measured by the Lorge-Thorndike 
Intelligence Tests, Form A, Level 3, Verbal Battery. Part 
1 requires a student to read a sentence with a word deleted 
and choose from among five alternatives the word that best 
completes the sentence. Part 2 requires the student to choose 
a word that is of the same category **.s a set of stimulus words. 

Part 3 requires the student to solve arithmetic word problems 
and select the correct answer. Part 4 requires the student 
to choose a synonym for a stimulus word from among five 
alternatives . 

These three tests are typical of traditional ways of measuring 
word recognition, vocabulary and intelligence. In order to be useful 
variables, they should provide clean measures of what they purport 
to measure and nothing else. Unfortunately they appear to suffer 
as do comprehension tests from a lack of construct validity. Because 
of this lack there is an inability to choose the relevant behaviors 
to measure and these tests end up measuring more than their putative 
variables. Ihey seem be measuring some common skills# In the 
first place they all require the student to recognize the words on 
the test as a prerequisite to successful performance on the items. Thus 
to some unknown degree they all are measuring students' ability to recognize 
words* In the second place the Word Knowledge, Word Discrimination 
and the I.Q. tests are all measuring word meanings. Thirdly all three 
tests contain items that are quite similar to a Cloze test in the 
sense that deleted words from sentences must be Identified. Thus 
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they are measures of comprehension as well. Thus on the face of it 
they have much in common with each other as well as with the dependent 
variable of this study, the Cloze test. This interdependence car- 
te seen in the intercorrelations of these variables as presented 
in Table 8. 

TABLE 8 

Intercorrelations i Dependent and Control Variables* (N«87) 



Variable 


"l 


2 


3 


4 


5 


X 


S.D. 


1. Word Discrimination 


1.000 


*761 


.585 


.663 


.542 


4.714 


0.886 


2, Word Knowledge 




1,000 


,674 


,740 


.621 


4.560 


1.005 


3. I.Q. 






1,000 


,587 


.484 


117.011 


12.974 


4. M.A.T. Reading 








1,000 


.528 


4.984 


1.112 


5. Cloze Test 










1.000 


150.000 


24.712 



*A11 correlations significant at the .01 level or less. 



The global nature of the control variables plus their similarity 
to the Cloze comprehension test should tend to make it more difficult 
to show a relationship between the D.S.R.T. and the Clcte test when 
these variables are controlled. Because they measure so much, these 
variables might be expected to account for much of the relationship 
between D.S.R.T. and Cloze comprehension if indeed a positive relation- 
ship is found* 

The same type of argument can be raised in terns of the control 
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variables as they relate to the traditional measure of comprehension 
i.e. , the Reading subtest of the M.A.T. First the M.A.T. Reading sub- 
test is measuring ability to recognize words since an S must be able 
to recognize the words before he can do the test. Second, 6 of the 
44 items on the M.A.T. Reading test ask for the meanings of specific 
words. Control variables which measure word meanings i.e., Word Know- 
ledge and I.Q. will have inflated correlations with the M.A.T. Reading 
because they are measuring some of the same things* Furthermore, 
since they require Ss to read sentences to do the test, all three 
tests ai€i measuring the comprehension of sentences. Thus the three 
control variables have a lot in comracn with the M.A.T. Reading as 
well as with each other. 

To summarize, the control variables d* not appear to be very 
clean measures of the variables they purport to measure. Thev have 
much in common with each other as well as with the Cloze and the M.A.T. 
Reading comprehension tests. 

Measurement of the Variables for Hypothesis 2 

Hypothesis 2: Children's skill in making a lexical analysis of 

the main verb of sentences is positively related to their reading 
comprehension skill. 

Independent Variable: Skill at making a l exical analysis of the main 

verb of sentenc es. 

This variable will be measured in two ways: first by a sentence 
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completion test, second by looking at Ss' performance on two subsets 
of paired items on the D.S.R.T. 

The Sentence Completion Test consists of the beginning of 
six sentences up to and including the verb, Ss are asked to complete 
these sentences with as many words as possible. 

The Sentence Completion Test 

1. The boy liked 

2. The girl expected 

3. The boy loved 

4. The girl remembered 

3. The boy helpe d 

6. The girl believed 

The verbs in these sentences can take either a transitive construction 
e.g., The boy liked the girl or a complement construction e.*;., The 
boy liked to go swimming. If it is assumed that complement structures 
are more complex (See Fodor, Garistt & Bever, 196S) and that Ss with 
more skill at making a lexical analysis of the verb will produce more 
complement sentences than Ss with less skill, then the number of 
complement sentences produced will constitute a measure of skill at 
making a lexical analysis of the verbs ol sentences, The measure 
employed for this variable is then the number of complement sentences 
produced in the sentence completion test. 

The second measure of Ss' ..kill at making a lexical analysis of 
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verbs consisted of ten items from the D.S.R.T. Two types of items 
were used. One type consisted of three paired items with the 
same sentence structure. One item of each pair had a transitive 
verb which can take only one deep structure and the other item had 
a complement verb which can take either a transitive or complement 
deep structure. Table 9 presents these items. 



TABLE 9 



Subset of D.S.R.T. Items: Transitive Verbs - Complement Verbs 



Transitive 



14. 



(bit) 



a. The fat girl bit the thin boy. 

b. The girl who is thin bit the boy who is fat. 
c« The boy who is thin was bitten by the girl 

who is fat. 



Complement 

(loved) 



7. a. The fat girl loved the thin boy. 

b. The girl who is thin loved the boy who is fat. 

c. The boy who ia thin was loved by the girl who 
is fat. 



Transitive 

(hit) 



Complement 

(knew) 



5. a. The girl who the boy hit fell down. 

b. The boy the girl hit f ell down. 

c. The boy who the girl hit fell down. 



18. a. The girl who the boy knew went home. 

b. The boy the girl knew went home. 

c. The boy who the girl knew went home. 



Transitive 

(kicked) 



11. a. The tall boy kicked the short girl. 

b. The boy who is tall kicked the girl who is short. 

c. The tall boy was kicked by the girl who is short. 



Complement 

(liked) 



23. 



a. The tall boy wa^ liked by the girl who was short. 

b. The tall boy liked the short girl. 

c. The boy who is tall liked the girl who is short. 
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The second type of item consisted of 2 sets of paired items 
containing the complement verbs "tell" and "ask," in the same sentence 
structures. The verb "ask" is more complex than "tell" because it can 
take more deep structures. (Carol Chomsky, 1968), Table 10 contains 
these items. 

TABLE 10 

Subset of D.S.R.T. Items: Tell vs. Ask 



Tell //I 



Ask //I 



Tell n 



Ask #2 
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a. The boy told the girl when to leave. 

b. The boy told the girl when he should leave. 

c. The boy told the girl when she should leave. 

a. The girl aske d the boy when to leave. 

b. The girl aske d the boy when she should leave. 

c. The girl aske d the boy when he should leave. 

a. The boy told the girl what to do. 

b. The boy told the girl what he should do. 

c. The boy told the girl what she should do. 

a. The boy aske d the girl what she should do. 

b. The boy aske d the girl what to do. 

c. The boy aske d the girl what he should do. 



If it is assumed that Ss wii:h less skill at making a lexical 



analysis of verbs will tend to make more errors on items with complex 
verbs than on items with simpler verbs, then differences in performance 
on the paired items will constitute a measure of skill at making a 
lexical analysis of main verbs of sen*.ences. 

Dependent Variable: Reading Comprehension Skill . 

This variable was measured in the same way as for hypothesis i. 
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CHAPTER IV 

Description of the Sample and Testing Procedures 
Description of the Sample 

The original sample for the study consisted of 110 5th grade 
students in five classrooms in two schools in a suburban community 
near Boston. When those Ss were eliminated who had missing data and 
who had word recognition problems, the sample size was reduced to 87. 
Table 11 presents the basic statistics on this sample. 



TABLE 11 

Basic Statistics of the Sample (N»87) 



Variable 


X 


Median 


S.D. 


Range 


Skewness 


Age (mos.) 


128.034 


128.00 


4.392 


112-140 


-0.235 


I.Q. 


117.011 


114.00 


12.974 


101-150 


0.988 


Cloze Test 
(T score) 


150.000 


150.85 


24.712 


87-205 


-0.250 


M.A.T. 3 












Word Knowledge 


4.566 


4.3 


1.005 


2. 4-6. 5 


0.240 




(6.4) 


(6.1) 








Word Discrim. 


4.714 


4.7 


0.886 


2.3-5. 7 


-0.722 




(5.5) 


(5.5) 








Reading 


4.984 


5.2 


1.112 


2. 2-6. 5 


-0.605 




(5.4) 


(5.8) 




(2. 7-8. 3) 





a 

The M.A.T. scores are reported in local norms. National norms are 
presented in parentheses . The expected grade level on national norms 
is 5-2 since the Ss were tested in the 2nd month of the fifth grade. 
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The sample is above average in I.Q., above grade level in Word 
Knowledge, Word Discrimination and Reading according to the national 
norms* 

The sample consisted of 42 males and 45 females. The basic 
statistics by sex on the relevant variables plus u t u tests for 
differences between sexes are presented in Table 12. 



TABLE 12 

Basic Statistics of Sample by Sex 





Kales 


(N-42) 


Females (N-45) 


Variable 


X 


S r D • 


X 


S.D. 


t 


Prob* 


Age (mos.) 


128.333 


4*812 


127.756 


3.995 


*611 


0.543 


I.Q* 


117.881 


14*407 


116.200 


11.585 


*602 


0.549 


Cloze Test 


48.371 


10.331 


51.111 


8.585 


-1.233 


0.221 


M.A'T. 














Word Knowledge 


4.388 

(6.4)* 


1.061 


4.544 

(6.3)* 


0.961 


*201 


l.COO 


Word Discrimination 


4.683 

(5.5)* 


0*877 


4.742 

<5.5) a 


0.903 


.308 


1.000 


Reading 


4.993 

(5.4) a 


1.199 


4.976 

(5.4)* 


1.037 


• 

o 

■vj 

to 


1.000 



^National norms 



As can be seen from Table 12 there were no significant sex differences 
on these variable* 
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Testing Procedures 

Ss were tested in classroom groups whose size ranged from 15 to 
32* Thare were two testing sessions which lasted approximately 45 
minutes each. The tests were given in the following order: Sentence 

Completion, Cloze 0i t Cloze i/2, Cloze 03 and D.S.R.T. A brief version 
of the directions for these tests is included here. 

1* Sentence Completion Test 

Ss were fold to finish the sentence with as many words 
as they could. There was no time limit on this test. 

2. Cloze Tests 01 and 0 2 

Ss were told to fill in the missing word after going 
over some samples. There was no time limit set on these tests. 

3 . Cloze Test # 3 

Ss were given the intact p.ragrt^h and told to read it 
silently. They were also told that they would be doing a 
Cloze test on the paragraph. Aftet 2 minutes they were to id 
to 8 top and the paragraphs were collected. Then Cloze Test 
#3 was given out and the students followed the same procedure 
for Cloze Tests 01 and #2. 

4. The D.S.R.T. 

The sample items were completed by the whole class and 
then discussed to make sure the Ss understood the directions. 
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Ss were told to do the rest of the items as fast as they could 
while still being accurate. In addition they were told not to 
go back and change answers in order to get their first response. 

After five minutes Ss were told to stop and circle the item that they 
had just completed. Then they were allowed to finish the test with 
r.o time limit. 
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CHAPTER V 



Results and Analysis 



Hypothesis 1 



Hypothesis Is Children’s skill at recovering the deep structure of 
sentences is positively related to reading comprehension skill. 

This hypothesis was tested by a correlational and regression 
analysis. First the simple zero order correlations between the D.S.R.T. 
and the dependent variable, Reading Comprehension, were calculated. 
Second the relationship between the D.S.R.T. and Reading Comprehension 
was examined with other variables that could explain the relationship 
controlled through the use of partial correlations, Third, a 
regression analysis was performed to determine the relative contribution 
of the independent variable and control variables in accounting for 
the dependent variable. Finally the complete analysis was repeated 
for the male and female subgroups to examine sex differences, 

Zero Order Correlations 

Hypothesis 1 predicts significant and positive correlations 
between the D.S.R.T. and Reading Comprehension. Table 13 presents 
these correlations. 



TABLE 13 



Correlations between D.S.R.T. and Reading Comprehension (N=»87) 



Variable 



Cloze 



M.A.T. Reading 



D.S.R.T 



. 732 ** 



.476** 
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Clearly the results conform to the prediction* 

The correlations between the D.S.R.T. and the Cloze Test are 
significant and quite large, with more than 50% of the variance accounted 
for by the D.S.R.T. The relationship between the D.S.R.T. and the 
M.A.T. Reading is significant but not as great as for the Cloze 
Test* For the M.A.T. Reading approximately 23% of the variance is 
accounted for by the D.S.R.T. This lower correlation is to be 
expected because as has been argued in Chapter IV this traditional 
type of comprehension test measures much more than reading comprehension. 
Thus a variable like the D.S.R.T, would not be expected to exhibit 
as high a degree of relationship to a traditional test as to a more 
direct measure of comprehension such as the Cloze test* Furthermore, 
it must be reiterated here that the Cloze test Is the measure of the 
dependent variable for this study for the reasons discussed in Chapter 
IV, The M.A.T. Reading is only included to show the results on a 
traditional comprehension test but not as a test of the hypothesis. 

When the6t correlations are attenuated to correct for the unreli- 
ability of the tests, the correlations increase to .864 for the Cloze 
test and to .566 for the M.A.T*. Reading. Thus, if the tests were 
perfectly reliable the D.S.R.T, would account for 74% of the variance 
ia the Cloze test and 32% in the M.A.T. Reading. 

In general, reliabilities tend to be so high to begin with that 
It is difficult to raise them very much. Consequently attenuated 
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correlations aren t usually possible to attain in reality. However, 
the relatively low reliability of the D.S^R.T., .796, indicates that 
its reliability can probably be improved through adding more items. 
Thus it is realistic to assume that correlations approaching the 
attenuated correlations are attainable. The analysis presented 
here will employ the unattenuated correlations. Thus, the findings 
will tend to be conservative and may underestimate the true relation- 
ship between the D.S.R.T. and reading comprehension. 

In summary, the zero order correlations offer strong support 
for hypothesis 1. 

Partial Correlations 

The zero order correlations do not tell the whole story. A 
positive 2 ero order correlation between two variables may be complete 
ly explainable by a third variable with which both variables are 
correlated. Thus the original correlation may be completely spurious 
and the third variable may be the important one. To examine this 
possibility, partial correlations were run between the D.S.R.T. and 
the Cloze test and the M.A.T. Reading with each of the three control 
variables partial led out* Table 14 presents the Intercorrelations 
of all the variables used in testing hypothesis 1. Table 15 presents 
the partial correlations. 
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TABLE 14 



Intercorrelations of all Variables Used in Testing Hypothesis 1* (N-87) 




TABLE 15 



Partial Correlations: D.S.R.T. and Reading Comprehension with I.Q., 

Word Discrimination (W.D.), Word Knowledge (W.K.) Controlled 



Variable 


0 order r 


Partial Correlations 






I.Q. 


W,D. 


W.K, 


D.S.R.T. vs « 










Close Test 


.732** 


,658** 


.657** 


.597** 


M.A.T. Reading 


.476** 


.280** 


.290** 


.133 



**p <.01 
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The partial correlations show that the control variables do not 
explain much of the relationship between the D.S.R.T. and the Cloze 
test of comprehension. Very little of the relationship between the 
D.S.R.T. and the Cloze test can be accounted for by I.Q., Word 
Discriminate % and Word Knowledge. This data provides further strong 
confirmation for hypothesis 1, 

The partial correlations for the M.A.T. Reading show that the 
control variables explain much of the relationship between the D.S.R.T. 
and the M.A.T. Reading. This is to be expected since these variables 
have much in common with the M.A.T. Reading test. 

Regression Analysis 

It is important to inquire into the relative importance of the 
D.S.R.T. and the control variables in explaining reading comprehension. 

A regression analysis can be employed to conduct this inquiry. A 
short discussion of this type of analysis follows. 

A regression analysis attempts to find the optimal weighted 
combination of a number of Independent variables in predicting a dependent 
variable. The degree of predictability that the Independent variables 
provide can be looked at in terms of the amount of variance in the 
dependent variable that is accounted for by the Independent variables. *' 
Thus the total variance in regression analysis can be conceived of as 
consisting of two parts — the explained variance and the unexplained 
variance. The explained variance can be further broken down into two 
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parts — the unique variance and the common variance. These four 
components are described below* 

Explained Variance This is the amount of variance that 
is accounted for by all the independent variables;* It is 
the sura of the unique and common variance* 

The Unique Variance This is the amount of variance 
that a given variable does not share with the other 
variables. It is specific to a particular variable. 

The Common Varianc e Thi3 is the variance that is 
shared by all the independent variables. It is what 
is common to all of them. 

Unexplained Variance This is the amount of variance 
that cannot be predicted by the variables. It Includes 
error variance plus variance that is due to other variables 
not treasured in the regression analysis. 

These components of variance add up to 100? which is the 

2 

total variance. The variance explained is simply the multiple R with all 

predictor variables entered into the regression equation. The unexplained 

2 

variance is the multiple R subtracted from 100* 

The relative importance of a variable can be looked at In 
two ways* First the unique variance attributable to that variable can 
be looked at. This figure provides an estimate of the unique importance of 
a variable i.e., how much variance it can explain that cannot be explained 
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by the other variables. The unique variance is found by entering all 

the variables except a specified variable into the regression first. 

2 

Then the increase in R found when the specified variable is entered 
into the regression gives the unique variance of that variable. A 
second estimate of the relative importance of a variable can be obtained 
by adding to the unique variance of that variable a share of the common 
variance that is in proportion to its zero order correlation with 
the dependent variable. This estimate is obtained by multiplying 
the beta weight of each variable by its zero order correlation with 
the dependent variable. 

Table 16 presents the results of the regression analysis with 
the Cloze test as the dependent variable. 

TABLE 16 



Regression Analysis: D.S.R.T., Word Knowledge, Word 
Discrimination, I.Q. with Cloze Test (N»87) 



Variable 


(1) 

O-order 

r 


(2) 

Beta 

Weight 


% Unique + 
proportion 
Common 
(1) x (2) 


X 

Unique 

Variance 


*x 

50 C 

1 


X 

Vari ice 
Explained 


D.S.R.T. 


.732 


.554; 


40.6 


21.3 


.785 


61.6 


Word Knowledge 


.621 


.2118 


13.2 


01.4 






Word Discrimination 


.542 


.1351 


07.3 


00.8 






I.Q. 


.484 


i .0097 


00.5 


00.0 












Total 

Unique 


23.5 


X Unexplained 
Variance , 








Common 

Variance 


38.1 







O 



ERIC 



91 



1 



80. 

Table 16 shows that the D.S.R.T. Is the only variable in the 
regression that has any unique variance worth discussing. And the 
amount of unique variance (21,3%) is very substantial. The analysis 
or the unique plus common variance shows that the D.S.R.T. accounts 
for 40.6% of the explained variance which is quite substantial. This 
percentage is three times as great as the next highest variable which 
contributes 13.2% and over 5 times as great as the next highest variable 
Word Discrimination (7.3%). I.Q. makes almost no contribution at all. 

The results of the regression analysis demonstrate tftat the 
D.S.R.T. is, by a substantial amount, the most important factor in 
reading comprehension as measured by the Cloze test when compared to 
the other variables in the analysis. 

As expected when the M.A.T. Reading is the dependent variable 
the situation changes as Table 17 shows. 



TABLE 17 

Regression Analysis: D.S.R.T.., Word Discrimination, 

Word Knowledge, I.Q. with M.A.T. Reading (N=87) 



Variable 


(1) 

0-order 

r 


(2) 

Beta 

Weight 


% Unique + 
proportion 
Common 
(1) x (2) 


% 

Unique 

Variance 


Mult 

R 


% 

Variance 

Explained 


D.S.R.T. 


.476 


.0851 


04.1 


0.5 


.765 


58.6 


Word Knowledge 


.740 


.4525 


33.5 


6.4 






Word Discrimination 


.663 


.2121 


14.1 


1.9 






I.Q. 


.587 


.1137 


06.7 


0.7 












Total 

Unique 


9.5 


% Unexplained 
Variance 41.4 


ERjt 






Common 

Variance 


49.1 





31 . 



With the M.A.T. Reading as the dependent variable, Word Knowledge 
accounts for the highest amount of unique variance (6.4%). However 
this amount is quite small. The other variables account for hardly 
any unique variance. In fact all the variables taken together only 
account for 9.5% of the unique variance, leaving a common variance 
of 49.1%. This means that almost 5/6ths of the variance explained is 
common variance i.e., variance common to all the tests. This adds 
support to the suggestion in Chapter IV that the Word Knowledge, 

Word Meaning and I.Q. tests are measuring the same thing. The dependent 
variable is so global as are three of the independent variables that 
it is not surprising to find a lot of common variance. The unique 
plus common variance distribution shows that Word Knowledge is (33.5%), 
slightly more than twice as important as Word Discrimination (14%), which 
in turn is nearly 3 times as important as either I.Q. or the D.S.K.T. 

Thus Word Knowledge seems to be the most important factor in the M.A.T. 
Reading . 

To summarize, the evidence from the zero order and partial 
correlations overwhelmingly confirms hypothesis 1. In addition the 
regression analysis suggests that the D.S.R.T. is a very important factor 
in comprehension, a factor not accounted for by other variables such 
as Word Knowledge skill, Word Recognition skill, and I.Q. The regression 
analysis for the total group provides strong evidence that the recovery 
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of deep structure is an important aspect of reading comprehension. 
Sex Subgroup Analysis 

In order to see if hypothesis 1 had differential effects, the 
total sa le was broken into a male and a female subgroup and the 
analysis was repeated for each subgroup. As can be ceen from Table 
18 the only eex differences in mean scores on the variables was on 
the D.S.R.T. On all other variables there are no differences. 



TABLE 18 

Comparison of Keans for Males and Females 



Variable 


Males (N«=42) 


Females <N»45) 




Prob. 


X 


S.U. 


X 


S.D. 


t (two-tailed) 


D.S . R« T . 


17.452 


4.278 


19.622 


4.000 


2.44 


.01 


Word Knowledge 


4.588 


1.061 


4.544 


.961 


0.201 


N.S. 


I.Q. 


117.881 


14.407 


116.200 


11. 585 


0.600 


H.S. 


Word Discrimination 


4.683 


0.877 


4.742 


.903 


0.300 


N.S. 


M.A.T. Reading 


4.922 


1.199 


4.976 


1.037 


0.070 


N.S. 


Cloze Test 


48.571 


10.331 


51.111 


8.858 


1.233 


N.S. 



Note. - N.S. * nonsignificant 
Zero Order Correlations ; 



Table 19 presents the zero order correlations between the D.S.R.T. 
and Reading Comprehension by eex: 
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TABLE 19 



Correlations between D.S.R.T and Comprehension by Sex 





Cloze 


M.A. 


T. Reading 


Variable 


Hale 


Female 


Male 


Female 


D.S.R.T. 


,768** 


.682** 


.682** 


.279 



** p < .01. 



These correlations support hypothesis 1 for both males •' \d 
females because of the significant and substantial correlation between 
the D.S.R.T. and the Cloze Test. 38.9% of the variance is accounted 
for by the D.S.R.T. for males and 46.5% for females, The difference 
between the correlations for males and females is not statistically 
significant. 

The attenuated correlations are .864 for males and .805 for 
females. These correlations indicate that if the tests were perfectly 
reliable 74% of the variance in the Cloze test could be explained for 
icales and 64% for females. On the M.A.T. Reading the D.S.R.T. is 
significantly correlated for males but not for females. 

Partial Correlatio ns The partial correlations by sex are 




presented in Table 20. 
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TABLE 20 

Partial Correlations: D.S.R.T. and Reading Comprehension with I.Q., 

Word Discrimination (W.D.), Word Knowledge (W.K.) controlled. 

Analysis by eex. 



Variable 


0-order r 




Partial 


Correlations 






D.S.R.T. vs . 






I.Q. 


W.D. 


W.K. 






















M 


F 


M 


F 


M 


F 


M 


F 


Cloze 


.788** 


.682** 


.666** 


.569** 


.569** 


.662** 


.510** 


.573** 


M.A.T. Rdg. 


.682** 


.279 


. 438** 


.020 


.385* 


.186 


.213 


-.108 



*p< .05 **p<.01 



The partial correlations between D.S.R.T. and the Cloze test 
of reading comprehension with I.Q., Word Discrimination, and Word 
Knowledge separately controlled show significant correlations for 
both males and females. Taken individually the control variables do 
not explain the relationship between the D.S.R.T. and the Cloze test 
of comprehension. Thus for both male and female subgroups hypothesis 
1 is confirmed. 

On the M.A.T. Reading the partial correlations are not significant 
for females but are significant for rales with I.Q. and Word Discrimin- 
ation controlled. With Word Knowledge controlled the partial correlation 
becomes lion significant for males also. 
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Regression Analysis The intercorrelations of all the variables used 
in testing hypothesis 1 are presented by sex in Table 21 (p.86). These 
correlations show the same general pattern as for the total group. 

The correlations for the males appeat to be generally higher than for 
the females. 

The results of the regression snalysis with the Cloze comprehension 
test as the dependent variable are presented in Table 22 (p.87). 

Only the D.S.R.T. contributes any substantial degree of unique 
variance. The unique variance contributed by the females is nearly 10% 
more than for the males. As for the common plus unique variance the 
D.S.R.T. accounts for nearly 2/3rds of the total variance accounted 
for in both male and female subgroups. This is nearly twice as much 
as the next highest variable. Next to the D.S.R.T. > Word Knowledge 
neeos to be more important for f'aualcc and Word Discrimination for 
males. The results of this analysis by sex is more or less the same 
as for the total group. For both males and females the D.S.R.T. appears 
to be a very important unique aspect of reading comprehension as 
measured by the Cloze test. 

The results of the regression analysis for sex subgroups with 
the M.A.T. Reading as the dependent variable are presented in Table 23. 
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Inter correlations of All Variables Used in Testing Hypothesis 1, by Sex: 

Kales (N"42) Females (N~A5) 
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wions over .288 ere significant at the . o5 level. Correlations over ,338 are significant 
,01 level. 



Regression Analysic by Sex: D.S.R.T., Word Knowledge, 
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For males very little unique variance is accounted for by any 
of the variables. When unique and common variance are combined, the 
D.S.R.T. and Word Discrimination are the two most important variables* 

For females Word Knowledge is the most important variable accounting 
for 20,3% of the unique and 53.8% of the unique plus common variance. 

The other variables don’t contribute very much if at all. For males 
the D.S.R.T* and Word Discrimination are important and for females 
Word Knowledge is the only important variable. The D.S.R.T., as a 
factor in reading comprehension, as measured by the M.A.T. Reading, 
is important for males and of little importance for females. 

Hypothesis 1; Conclusions 

The results of the analysis of hypothesis 1 yield the following 
conclusions . 

1. Hypothesis 1 is supported by the results. The results of the 
analysis show that Ss’ ability to recover deep structure is related to 
reading comprehension. 

2. Recovering the deep structure is an important aspect of 
reading comprehension. In fact Ss’ skill at recovering the deep structure 
of sentences is a much more important aspect of reading comprehension skill 
as measured hy a Cloze test, than I.Q., word knowledge and word recognition 
skill. Furthermore only Ss‘ ability to recover deep structure makes any 
substantial unique contribution to reading comprehension skill. 
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3, Word knowledge is a more important aspect of reading 



comprehension , as measured by a traditional comprehension test, than 
word recognition, I.Q. and skill at recovering deep structure. However, 
none of these factors makes very much unique contribution to reading 
comprehension as measured by a traditional comprehension test. 

4. Females are superior to males in skill at recovering the 
deep structure of sentences, 

5. There are no sex differences in the importance of skill at 
recovering deep structure as an aspect of reading comprehension when 
reading comprehension is measured by the Cloze test. 

6. For both male and female subgroups, Ss* skill at recovering 
the deep structure of sentences is a much more important aspect of 
reading comprehension, as measured by the Cloze test, than I.Q., word 
knowledge and word recognition skill. 



7. For males, word recognition skill and skill at recovering the 



deep structure are both more important aspects of reading comprehension, 
as measured by the M.A.T. Reading test, than word knowledge and I.Q. 
However none of these variables makes very much unique contribution to 
rending comprehension skill. 



8. For females word knowledge is a much more important aspect of 



reading comprehension, as measured by the M.A.T. Reading test, than I.Q., 
word recognition skill and skill at recovering the deep structure. Further- 
ttore for females word knowledge makes the only substantial unique contribution 

O 
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to reading comprehension. 

9. The findings for the Cloze criterion appear to hold up 
under replication. The testing took place in two different schools, 
and the testing in the second school constitutes a replication of the 
testing in the first school. The analysis was repeated for each school 
separately. The results as reported in Appendix D for both schools 
support hypothesis 1, 

Hypothesis I t D iscussion 

The differences in the results for hypothesis 1 between the 
Cloze test criterion and the M.A.T. Reading test criterion deserves 
further comment. For the total group Ss 1 skill in recovering deep 
structure appears to be an important aspect of reading comprehension 
when reading comprehension is measured by the Cloze test. When reading 
comprehension is measured by the M.A.T. Reading test, Ss 1 ability to 
recover deep structure, although related le reading comprehension, does 
not appear to be an important aspect of it as compared to I.Q., word 
discrimination and word knowledge skill. The two ways of measuring 
comprehension then produce different results. 

The difference in results can be explained by problems in the 
measurement of variables. As discussed in Chapter III, the I.Q. test, 
the Word Discrimination test and the M.A.T. Reading test all seem to be 
measuring the same thirg, thus they seem to have a great deal of common 
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variance. This is exactly what Table 17 shows — a great deal of 
common variance and very little unique variance. This shared variance 
appears to be due in part to the measurement procedure, i.e. standardized 
multiple choice tests, rather than to the intrinsic relationship of the 
variables. v * ' 

Furthermore the fact that the M.A.T. Reading test, Word Knowledge 
te9t and Word Discrimination test are subtests of the same standardized 
test guarantees that they will be correlated with each other. All this 
plus the global nature of the variables insures a great deal of common 
variance. It is .thus difficult for a relatively clean cut measure like 
the D.S.R.T. to account for any unique variance in the M.A.T. Reading 
test. The results gathered from the M.A.T. Reading test criterion are 
difficult to interpret because of these measurement problems^ Therefore, 
the use of the Cloze test criterion constitutes the only valid test of 
hypothesis 1. 

The analysis of the sex subgroups also deserves comment. The 
only differences between the mean scores for males and females is on 
the D.S.R.T. where females are superior to the males. On all other 
variables there is no statistical significance between males and females. 

The superiority of the females over the males in ability to recover the 
deep structure of sentences may be due. to the females 1 greater skill or 
to some extraneous aspect of the test or testing situation. This difference, 
if it were upheld under replication, would deserve further study. 
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Another sex difference is in the relationship between the D.S.R.T. 
and the M.A.T. Reading test. For males the D.S.R.T. is related to the 
M.A.T. Reading test, but for females there is no relationship. This 
difference is not easy to explain. 

At first glance then it appears that the ability to recover deep 
structure is an important aspect of reading comprehension for males but 
not for females. This might be explained through the following speculation. 
The ability to recover deep structure is a lower level skill than some 
of the higher level skills that traditional reading tests claim they are 
measuring. Since males are usually lower than females in reading ability 
at the beginning stages of reading, it is possible that the males are 
at a lower stage of reading development than the females. And at this 
lower stage, where the males are, the ability to recover the deep structure 
is important, but at the higher stages, where the girls are, it is not 
important* This would explain the significant correlation for males and 
the lack of significant correlation for females between the D.S.R.T. and 
the M.A.T* Reading test. 

This speculation, however, is not borne out by the data. The 
notion that the females are on a higher level and that the males are on 
a lower level does not seem to hold for the sample studied here. First, 
there are no mean differences between the males and females on all the 
variables except the D.S.R.T. Thus the data does not indicate that 
females generally are at a higher level than males. Second, there are 
no sex differences in the strength of the relationship between the D.S.R.T. 

0 
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and the Cloze test. Third, there is no theoretical reason to expect 
sex differences. 



criterion is due to a pecularity of this test or of the sample studied. 
What pecularity of the sample or the M.A.T. Reading test is at work 
here remains an open question. It is very possible that the difference 
will disappear under replication. If this difference is maintained 
under replication, however, it should be investigated further. 



Hypothesis 2: Children's skill in making a lexical analysis of the 

main verb of a sentence is positively related to their reading comprehension 



This hypothesis can be tested in two ways, The first way is 
through correlations between the sentence completion test and reading 
comprehension. The second way is by examining through t tests the relative 
performance of good and poor readers on the subset of items on the D.S.R.T. 
as discussed in Chapter III. 

/ 

Sentence Completion Test 

The first test of hypothesis 2 can be made by correlating the 
number of complement sentences produced by Ss on the Sentence Completion 

O 



It appears that the sex differences with the M.A.T. Reading 



Hypothesis 2 



skill. 
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test with reading comprehension skill# Hypothesis 2 predicts a positive 
and significant correlation# 

These correlations arc presented in Table 24. 

TABLE 24 

Correlations: Sentence Completion Test vs. Reading Comprehension (W=87) 





Cloze 


M.A,T. Reading 


X 


Sentence Completion 


.270* 


.191 


3.4:7 


(Number of Complement 








Sentences) 









* p<.05 



The results show confirmation for hypothesis 2, The correlation 
between the Sentence Completion test and the Cloze test is statistically 
significant but low# On the M.A.T. Reading it is non significant. 

A repeat of the analysis for male and female subgroups is presented 
in Table 25. 

TABLE 25 

Correlations: Sentence Completion vs. Reading Comprehension by Sex 



Sentence Completion 


Cloze 


M.A.T, Readinj 


! * 


S.D. 


N 


Males 


.389* 


#325* 


3.0*8 


1.847 


42 


Females 


.069 


,031 


3*800 


1.5)7 


45 



*p C .05 
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The hypothesis is confirmed for males but not for females. The 
significant correlation for the total group is due to the male subgroup. 

Item Subsets from the D.S.R.T. 

. . r. Jhe test of hypothesis 2 involves looking at the performance 

o; '; } uo u ; 'anu po~. readers on the subset of the D.S.R.T, items ‘ containing 
lexically simple and complex verbs. This involves dividing the group 
into good readers, i.e# upper 40% on the Cloze test, and poor readers, 
i*e. lower 40% on the Cloze test. Next a comparison of performance for 
each of these groups on the items containing the simple verbs, i.e. 
transitive and tell verbs versus items containing more complex verbs, 
complement and ask verbs, is made through the use of t tests. 

The expected performance is as follows: 

Simple Verbs Complex Verbs t-test 

Good Readers X ^ X 

(Upper 40% 

Cloze test) 

Poor Readers X ^ X t-test 

-'(Lower 40% 

-Cloze test) 

Total Group X > X t-test 

These predictions are based on the following line of reasoning: 

Total Group Simple verbs will be easier than complex verbs accoi'd- 
ing to the lexical analysis strategy for recovery of deep structure as 
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proposed by Fodor, Garrett and Bever 1968 and discussed In Chapter II. 

G ood Readers These readers tend to employ the lexical analysis 
strategy and thus will find simple verbs easier than complex verbs. 

Poor Readers The readers because of less skill at employing 
V lexical a -Jysis strategy will find the complex verbs and simple 
vcvbs equally difficult. 

Table 26 presents the results of this analysis. 



TABLE 26 
t-tests 

Simple vs. Complex Verbs by Reading Comprehension (Upper 40%/Lover 40%) 





Simple Verbs 
(Transitive) 


Complex Verbs 
(Complement) 






X 


S.D. 


N 


X 


S.D. 


N 


Diff 


t 


Prob. 

(one-tailed) 


High Cloze 
(Upper 40%) 


2.611 


.494 


36 


2.444 


.558 


36 


.167 


1.435 


^.10>.05 


Low Cloze 
(Lover 40%) 


1.800 


.933 


35 


1.771 


.877 


35 


.029 


0.128 


>.10 


Total Group 


2.218 


.(127 


87 


2.069 


.818 


87 


.149 


1.312 


<il0>.05 




Simple Verbs 
(Tell) 


Complex Verbs 
(Ask) 






X 


S.D. 


N 


X 


S.D. 


N 


Diff 


t 


, Prob. 
(one-tailed) 


High Cloze 
(Upper 40%) 


1.9)7 


.280 


36 


1.778 


.485 


36 


.139 


1.405 


<.10>.05 


Lew Cloze 
(Lover 40%) 


1.371 


.690 


35 


1.443 


.550 


35 


.229 


1.675 


'S' .06 


Total Group 


1 .644 


.590 


87 


1.488 


.605 


87 


.195 


2.557 


<,02 
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The predictions for the total group are supported in the case 
of the Tell-Ask verbs where the difference is in the predicted direction 
and this difference is significant. In the case of the Transitive and 
Complement verbs the difference is in the right direction but is not 
significant. In this case the prediction is not upheld. 

The predictions for the good readers are not supported. Although 
the transitive verbs are easier than the complement verbs and the Tell 
verbs are easier than the Ask verbs, the differences are not statistically 
significant. These differences do however approach the .05 level. 

The predictions for the poor readers are generally supported in 
that there are no statistically significant differences between simple 
and complex verbs. However for the poor readers the Tell-Ask difference 
is very close (p == .06) to being significant. This of course is contrary 
to the predictions of equal difficulty. 

In neither case, i.e. good readers or poor readers, are the 
predictions completely upheld by the data. 

In the case of the transitive and complement verbs the results 
are in the right direction but not significant. In the case of the 
Tell and Ask verbs the data does not support the hypotheses, so that 
overall the analysis of simple and complex verbs does not really support 
hypothesis 2. 

Hypothesis 2: Conclusions 

The results of the analysis offer only slim support for hypothesis 
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2. Skill at making a lexical analysis of the main verb of the sentence 
as measured in this study appears to be only slightly, if at all, 
related to reading comprehension. The results of the analysis show: 

! 

' 1. Skill at making a lexical analysis of the main verb of a 

sentence, as measured by the Sentence Completion test, shows a small 
but statistically significant relationship tc reading comprehension, as 
measured by the Cloze test, and no relationship when comprehension is 
measured by the M.A.T. Reading test. 

2. For males skill at making a lexical analysis of the main verb 
of sentences, as measured by the Sentence Completion test, is moderately 
related to reading comprehension, as measured by the Cloze test or the 
M.A.T. Reading 

3. For females skill at making a lexical analysis of the verb, 
as measured by the Sentence Completion Test, shows no relationship to 
reading comprehension, as measured by the Cloze test or the M.A.T. 
Reading. 

4. Skill at making a lexical analysis of the main verb of a 
sentence, as measured by the subset of paired items on the D.S.R.T., 

is not related to reading comprehension, as measured. by the Cloze test. 

Hypothesis 2: Discussion 

The results of the analysis of hypothesis 2 can only be described 
as mixed. The results of the first test of hypothesis 2, the Sentence 
Completion test, support the hypothesis but in a weak way. The second 
test of the hypothesis, the analysis of simple and complex verbs, shows 
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essentially negative results. 

The second test of hypothesis 2 deserves further discussion, 

Its negative results can be explained in two way3. First, the hypothesis 
could be wrong. If this is the case, then there is obviously no reason 
to expect positive results. A second explanation could be that the 
hypothesis is correct hut the. instruments used to measure the predicted 
effects are not sensitive enough to show the effects. The second pos- 
sibility seems plausible for several reasons. First the demonstration 
of the lexical analysis strategy as reported by Fodor et al. (1968) 
was a weak one. In other words the lexical analysis strategy is 
difficult to demonstrate and requires sensitive instruments and carefully 
controlled experimentation. Fodor used a reaction time criterion while 
the present study used an error criterion. In the study reported here 
the lexical analysis strategy was demonstrated for the total group only 
for the tell-ask verbs but not for the transitive-complement verbs. 

Thus the failure to replicate the Fodor et al. results, i.e. that complex 
verbs are more difficult to comprehend than simple verbs, leads one to 
believe that the measuring Instrument used, which was different from 
Fodor*s, was not sensitive enough to pick up the differences in processing. 
In short, It Is possible that the lack of positive results for hypothesis 
2 could be explained by the lack of sensitivity of the instruments measuring 
Ss* use of the lexical-analysis strategy. 
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CHAPTER VI 
Implications 

The recovery of deep structure has been identified in this 
paper as an important aspect of the reading comprehension process. The 
unambiguous nature of this concept renders it an excellent point of 
departure for further research in reading comprehension. This should 
in turn lead to nev instructional methods. 

Research Implication s^ 

Research into Ss' ability to recover deep structure should 
take the following direction. 

1. The present study should be replicated to determine if the 
results can be generalized beyond the sample studied. 

2. When this study is replicated, other measures of Ss’ ability 
to recover deep structure 6hould be developed to make certain 
that the results reported here are not due to pecularities 

of the D.S.R.T. rather than to skill in recovering deep 
structure. 

3. The developmental aspects of Ss 1 ability to recover deep structure 
should be investigated to determine how this skill is acquired. 

4. Research should be undertaken to investigate the strategies 
that S3 u£e in recovering the deep structure. This research 
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could follow three lines. First, other strategies, like 
the lexical analysis strategy, need to be developed. These 
strategies will probably come from the work in psycholinguistics. 
Second, research should be undertaken to distinguish which of 
these strategies are specific to reading and which are part 
of language comprehension in general. Finally, research 
should be undertaken to investigate the pertinence of these 
strategies to differences in reading comprehension skills. 

This research should take the form of studies aimed at 
determining whether or not differences in Ss 1 skill in utilizing 
various strategies differentiate good and poor readers. 

5. Instructional research should be undertaken to determine if 
the strategies that Ss use in recovering the deep structure 
of sentences are amenable to instruction. And if improvement 
in these strategies will improve reading comprehension. 

Instructional Implications 

Increased knowledge of the strategies Ss use in recovering deep 
structure should lead to the development of diagnostic tests to identify 
children who are having difficulty in recovering deep structure and to 
pinpoint the deficient aspect of the deep structure recovery process. 
Instructional procedures and materials could then be developed to remedy 
the particular problems identified by the diagnostic tests. It is 
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impossible to name these instructional procedures now since they will 
be determined by the types of strategies — at present unknown — that 
Ss use when recovering deep structure. 

In this author r s opinion it would be ill advised at present 
to develop instructional procedures and materials before more is known 
about these strategies. To do so would simply be to repeat the mistakes 
made in the past when research was applied to instruction (see Chapter I). 
It would be incorrect to assume that Ss’ skill in the recovery o c deep 
structure could be improved simply by providing them practice in this 
skill through materials like the D.S.R.T. Such an approach would be a 
typical example of the nonthcoret ical thinking which currently allows 
testing procedures to be used as prototypes for teaching procedures. 

This approach is wrong for two reasons. First there is no 
particular evidence that the measurement of a skill is a good model 
for instruction. Second this approach provides no way of helping students 
who can not do the task. 

It seems to this author the development of instructional procedures 
must wait until more is known about the actual strategies, not just the 
general skill, used in recovering deep structure. In this way instruction 
could be much more specific and directed at particular student deficiencies 
Thus increased knowledge of the strategies that Ss use in recovering deep 
structure could provide the key to instruction in this important skill. 

In summary, the findings of this study could lead In two directions 
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First they could lead to more research into children* s skill at recovering 
deep structure. Second it could lecd to the development of tests to 
identify students with deficiencies in this skill and to instructional 
procedures and materials to remedy these deficiencies. 
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Directions : 



Sample 1. 



Sample 2, 
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APPENDIX A 



Deep Structure Recovery Test (D.S.R.T.) 



Name 



Circle the sentence that has a different meaning, 

a, The boy hit the girl, 

b, The girl \:as hit by the boy, 

c, The boy vas hit by the girl, 

a. The fat girl looked at the short boy* 

b. The short boy looked at the fat girl, 

c. The fat girl was looked at by the short boy. 
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Name 



1* a. The boy gave a book to the girl. 

b. The book was given the girl by the boy. 

c. The book was given to the boy by the girl. 

2. a. What the boy would like is for the girl to leave. 

b. For the boy to leave is what the girl would like. 

c. What the girl would like is for the boy to leave. 

3. a. He painted the red house. 

b. He painted the house red. 

c. He painted the house that was red. 

4. a. The girl asked the boy when to leave. 

b. The girl asked the boy when she should leave. 

c. The girl asked the boy when he should leave. 

5. a. The girl who the boy hit fell down. 

b. The boy the girl hit fell down. 

c, The boy who the girl hit fell down. 
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Name 



6. a. The boy saw the paper that was folded. 

b. The boy saw the folded paper, 

c. The boy saw the paper folded. 

7. a. The fat girl loved the thin boy. 

b. The girl who is thin Joved the boy who is fat. 

c. The boy who is thin v?as loved by the girl who is fat. 

8. a. The boy saw his broken toy. 

b. The boy saw his toy that was broken. 

c. The boy saw his toy broken. 

9. a. That the girl was right was believed by the boy. 

b. What the girl believed was that the boy was right. 

c. The girl believed the boy to be right. 

10. a. The boy told the girl what to do. 

b. The boy told the girl what he should do. 

c. The boy told the girl what she should do. 
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11. a* The tall boy kicked the short girl. 

b. The boy who is tall kicked the girl who is short. 

c. The tall boy was kicked by the girl who is short. 

12. a. That the girl would go into the house was expected by the boy. 

b. What the girl expected was that the boy would go into the house. 

c. The girl expected the boy to go into the house. 

13. a. The boy begged the girl to tell the truth. 

b. What the boy begged the girl to do was tell the truth. 

c. To tell the truth was what the girl begged the boy to do. 

14. a. The fat girl bit the thin boy. 

b. The girl who is thin bit the boy who is fat. 

c. The boy who is thin was bitten by the girl who is fat. 

15. a. The girl taught the boy to use a pencil. 

b. What the boy taught the girl was to use a pencil. 

c. To use a pencil was what the girl taught the boy. 
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Name 



16. a. The wagon is easy for us to push. 

b. We push the wagon easily. 

c. The wagon pushes us easily. 

17. a. ,Thc girl ordered the boy to sit down. 

b. What the girl ordered th boy to do was sit down. 

c. To sit down was what the boy ordered the girl to do. 

18. a. The girl who the boy knew went home. 

b. The boy the girl knew went home. 

c. The boy who the girl knew vent home. 

19. a. The girl sees easily. 

b. The girl is easy to see. 

c. It is easy to see the girl. 

20. a. The man found his car stolen. 

b. The man found that his car was stolen. 

c. The man found his stolen car. 
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Name 



21* a. The boy told the girl when to Jeave. 

b. The boy told the girl when he should leave. 

C. The boy told the girl when she should leave, 

22. a. The boy sees that the girl is tall. 

b. That the girl is tall is seen by the boy. 

c. What the girl sees is that the boy is tall. 

23. a. The tall boy was liked by the girl who was short. 

b. The tall boy liked the short girl* 

c. The boy who is tall liked the girl who is short. 

24. a. What the girl wants is for the boy to find the ball. 

b. The boy wants the girl to find the ball. 

c. For the boy to find the ball is what the girl wants. 

> 

25. a. The buy asked the girl what she should do, 

b. The boy asked the girl what to do. 

c* The boy asked the girl what he should do. 
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APPENDIX B 

Item Statistics: D.S.R.T. (N=!03) 



Item 

Number 


Item Difficulty 
(Proportion correct) 


Item Validity 
(Item vs* Total 
score -Biserial r) 


i 


.845 


.510 


2. 


.709 


.654 


3 


.282 


,556 


4 


.563 


.387 


5 


.495 


.358 


6 


.485 


.648 


7 


.854 


.409 


8 


.544 


.683 


9 


.738 


.711 


10 


.786 


,549 


11 


.874 


.767 


12 


.709 


.573 


13 


.922 


.571 


14 


.845 


.568 


15 


.903 


.743 


16 


.903 


.703 


17 


.961 


.601 


18 


.631 


.436 


19 


.825 


.486 


20 


.767 


.646 


21 


.825 


.700 


22 


.825 


.842 


23 


.573 


.575 


24 


. .874 


.491 


25 

— ' - 


.515 


.632 
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APPENDIX C 

Cloze Tests of Comprehension 
Cloze Test #1. 



A nobleman and a met in a tavern. their 

lunch they ordered _____ . When it was brought, noble- 
man took a spoonful, _____ the soup was so that he 

burned his and tears came to eyes. The merchant 

asked was weeping. The was ashamed to admit 

_ had burned his ___ answered, ’’Sir, I once 

a brother who committed great crime, for which 

was hanged. I was of his death, and made me 

weep." The believed this story and to eat his 

soup. too burned his mouth, that he had tears 

his eyes. The nobleman it and asked the 

"Sir, why do you ?" The merchant, who now the 

nobleman had deceived , answered "My lord, I _ 

weeping because you were hanged together with your 
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Cloze Test 02. 

Moat ants hard workers and often from six 

o'clock In morning until ten o'clock night . 

The 1 b among the worker cuts _ that each one 

has certain amount to do. do not know how 

decide what each one to do, for they 

not talk. Some people ants follow each other 

their sense of spell. ______ often live to be year 

old, end some ^ been known to live or seven years. 

One they get their food from plant lice, which 

might call their cows. ants milk these "cows" 

tapping the lice gently a drop of honey 

out. Then they eat honey. Ants take very care 

of these plant and often they build covering 

over them so they will bo protected the rain. 
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Cloze Test #3. 

We all ready for the with our packs strapped 

securely. We rolluwed the nearly all day, except 

‘ we stopped to lunch the side if a cliff. 

By sundown we anxiously looking for a to canp 

for the We gathered n huge of eucalyptus 

branches to ready for our fire. supper we rolled 

up - our blankets near the and ve were soon 

We were awakened by queer sounds which my recognized 

as howls of jaguar. We hastily heaped branches 

on the fire was burning low. We our ukcleles and 

sang. means of much light noise we kept all 

away except an inquisitive and several rabbits and . 

We fell asleep toward and we were not again during 

our stay the canyon. 



O 

ERJC 



126 



115 



APPENDIX D 



Regression Analyses for each School 



School #1 



Remission Analysis: D.S.R.T., Word Knowledge 

Word Discrimination, I.Q. with Cloisc Test (N=39) 



Variable 


(1) 

O~order 

r 


(2) 

Beta 

Weight 


(1) * (2) 
X Unique + 
proportion 
common 
variance 


X 

Unique 

Variance 


Mult 

R 


% 

Variance 

Explained 


D.S.R.T. 


.75 4 


0.6237 


47.0 


26.5 


.791 


62.6 


Word Knowledge 


.584 


0.3308 


19.3 


03.3 




* 


Word L_ :> crimination 


.463 


-0.0786 


' -3.6 a 


00,0 






I.Q. 


.519 

i 


-0.0019 


-0. l a 


00.2 












% Total 
Unique 
Variance 


Sj 

"1 


% Unex- 
plained 
Variance 37.4 








X Common 
Variance 


32.6 







^Jord Discrimination and I.Q. act as suppressor variables thus decreasing 
the multiple R and consequently the amount of variance explained. 
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School #2 

Regression Analysis: D.S.R.T,, Wo id Knowledge 

Word Discrimination! I.Q. vlth Cloze Test (N=48) 



Variable 


(1) 

0-order 

r 


(2) 

Beta 

Weight 


i 

(1) X (2) 

% Unique -f 
proportion 
common 
variance 


% 

Unique 

Variance 


Mult 

R 


% 

Variance 

Explained 


D.S »R,T« 


.698 


. 49C 7 


34 ,3 


16.3 


. 797 


63.5 


Word Knowledge 


,658 


.1473 


09.7 


00.6 






Word Discrimination 


.614 


.3106 


19.1 


04,4 






I.Q* 


.473 


.0088 


00.4 


00.0 












%Total 

Unique 

Variance 


30,0 _ 


7» Unex- 
plained 
Variance 37.4 








%Common 

Variance 


32,6 
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